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Harassment on China’s campuses 


A string of cases at Chinese universities shows a system that is trying to address concerns and 


implement change, but still has some way to go. 


her own life after making allegations that she had been sexually 

harassed and raped by a professor. Her case made little impact 
at the time, but it is doing so now. The issues it raises highlight two 
points about how the #MeToo movement is now playing out on the 
campuses of Chinese universities. It shows the extent to which things 
have changed, and underlines ways in which these changes do not yet 
go far enough. 

The student was called Gao Yan. When friends and supporters last 
month highlighted the anniversary of her death, Peking University 
confirmed that an investigation at the time had criticized Shen Yang, a 
famed linguist at the university, for having an inappropriate six-month 
relationship with her, which he ended nine months before her suicide. 
Shen could not be reached by Nature. He told Chinese media the accu- 
sations against him were untrue. After the case was made public last 
month, he was fired from his post at Shanghai Normal University. 

Peking University, which Shen left in 2011, also posted previously 
unreleased documents disclosing some details of its 1998 investi- 
gation into the case. And it published statements that noted recent 
efforts, including the introduction in 2016 of a ‘teacher’s handbook, 
to reinforce the ethics of its professors and draft regulations concern- 
ing sexual harassment on campus (currently under consideration). 
The institution deserves some credit for these efforts, albeit two 
decades on. There are many other universities — in China and else- 
where — that stick their heads deep in the sand when controversy 
arises, on sexual harassment, scientific misconduct or other matters. 

That the university felt the need to respond to public pressure at 
all, never mind to issue statements and attempts at reassurance over 
a historic case, gives some indication of how things are changing for 
the better in China. Awareness of harassment, and intolerance for 
harassers, is certainly on the rise there, as in many places. A string of 
well-publicized sexual-harassment cases has hit university campuses 
in China in recent months, and senior academics accused of improper 
behaviour have lost their positions or faced other sanctions. 

The shift goes beyond academia, too. Some technology companies 
have been forced to apologize for discriminatory hiring policies that 
target attractive women, and for advertisements that boast about the 
beauty of their female employees. 

Together, the public airing of these cases is a positive development. 
Unsavoury activity is being exposed and, to some extent, those found 
guilty are receiving penalties that might deter others from similar 
behaviour. This could signal that China is reaching a new stage of 
transparency, where such issues can be discussed and sexual harass- 
ment will no longer be tolerated. But there are still many reasons to 
be concerned. 

Following the response from Peking University over the Gao Yan 
case, a group of current students there pressed the university for 
more details. Since then, one of them — Yue Xin — has complained 
on social media that university officials have pressured her to stop 


S ome 20 years ago, a student at Peking University in Beijing took 


asking for the information. Her allegations have made headlines and 
yielded statements of support from around the world. They have 
partly overshadowed official celebrations of the university’s 120-year 
anniversary this month. And, in response, the university’s newfound 
openness is faltering — numerous attempts to contact the institution 
have gone unanswered. Meanwhile, students say that posters they 

have put up around campus voicing support 


“Awareness of for Yue have been quickly removed. 

harassment, Chinese President Xi Jinping visited Peking 
andintolerance University last week. Ironically, he praised it as 
forharassers,is _ the birthplace of the 1919 May Fourth move- 
certainly onthe ment, a series of student protests that triggered 


rise.” wide social and political unrest and ultimately 
produced the nation’s communist leaders. 

Concerns over the response to the student protests must be 
addressed. It is one thing for universities to state that sexual harass- 
ment will not be tolerated. It is another entirely for them to buy into 
the kind of wholesale changes in regulations, behaviour and attitude 
that are required. The #MeToo movement is growing into an irresist- 
ible force. Now is not the time for universities — in China or else- 
where — to act like immovable objects. m 


Crowd scene 


Satisfying symmetry between an unusual test of 
quantum physics and peer review of its findings. 


ednesday, 30 November 2016, saw crowds of Cubans line 
Wi streets of Havana to say a final farewell to Fidel Castro. 

On the same day, fans gathered to watch Spanish football 
minnows Hercules hold giants Barcelona to a surprise one-all draw in 
their Copa del Rey tie. Anda crowd of 100,000 people around the world 
came together over an online video game to type ina series of Os and 
1s as fast as their fingers could fly, to put to the test a central feature of 
quantum mechanics. 

The gamers were part of a day-long experiment called 
the BIG Bell Test, the results of which are described on page 212 (The 
BIG Bell Test Collaboration. Nature 557, 212-216; 2018). The findings 
were produced in an unusual way and, with the symmetry that one 
would expect from a beautiful physics theory, they were reviewed for 
publication in an unusual way, too. 

Less than a year after the test, the BIG Bell Test paper landed for 
review from Nature on the desk of Sabrina Maniscalco, a physicist at 
the University of Turku in Finland. Seeing the crowdsourced public 
input it contained, Maniscalco decided to crowdsource the requested 
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review — well, up to a point. She showed it to seven of her senior PhD 
students, and then invited them to discuss it with her over a day of brain- 
storming, literature search and pizza. She channelled the resulting com- 
ments and criticisms into the referee's report that she submitted, and 
which all seven students saw again before submission. No confidence 
was breached during the review process: the students agreed to strict 
confidentiality, and Maniscalco opted in to our referee-accreditation trial 
scheme, which means that her name, together with those of the partici- 
pating students, appears in a statement at the end of the research paper. 
Two other reviewers offered their thoughts on the paper in the usual way. 

It is not unusual for a lab head to delegate a paper's peer review toa 
junior colleague, or to collaborate with one — and, unfortunately, the 
colleague's contribution isn’t always acknowledged. It is unusual, as far 
as we know, to make peer review a team exercise and to openly state 
that that was how it was done. It’s not an experiment we expect will be 
repeated often. Still, contrary to popular belief, our editors are, in prin- 
ciple, happy for referees to involve others in the review process, as long 
as confidentiality is assured and the editors are kept in the loop. It’s right 
for all those involved in peer review to be acknowledged. At its best, such 
collaboration can enrich the review process and help junior researchers 
develop the skills needed to become effective referees themselves. 

In this case, the paper explores the tension between quantum 
physics and local realism. The latter brings together two principles: 
locality — according to which, observing a particle at one physical 
location cannot have immediate effects on the properties of a particle 
at a different location — and realism, which expresses how the observ- 
able features of particles exist even if we dont actively measure them. 
But in quantum mechanics, correlations between distant particles 
exist that are so strong they violate local realism. Put differently, in 
quantum theory it is possible to have two correlated particles far away 
from each other, to measure the first and, as a result, learn something 


about the second without having observed it directly. 

So here's the conundrum: does quantum mechanics really violate 
local realism, or could it be the case that some unknown factors would 
complete the theory and explain these apparent violations? In the 1960s, 
the physicist John Bell offered a way to tackle the problem in the labora- 
tory, by studying quantum correlations in the form of entanglement. 

In these experiments, sequences of spatially 


“The BIG Bell separated measurements on entangled par- 
Test closes ticles lead to computing a quantity that can 
the ‘freedom- have values not possible in the context oflocal 


and realistic theories. Bell tests have con- 
firmed the validity of quantum theory many 
times, but they include assumptions that leave 
wiggle room for non-quantum explanations as to why local realism is 
violated, and so physicists have been looking for ways to close these 
loopholes ever since. 

In 2015, physicists showed that successful Bell tests could not be due 
to speed-of-light communication between the particles, or to inefficient 
detection processes during the measurements. But another, more subtle, 
loophole was still open. Bell tests also assume that experimenters have 
free choice over which measurements they make on each particle. And 
yet, hidden parameters could be influencing these choices to produce 
correlations that give the illusion of entanglement. 

The BIG Bell Test closes this freedom-of-choice loophole. The various 
experimental groups had no say in which measurement settings to 
use. Instead, they performed their measurements according to the 
unpredictable streams of bits received from the 100,000 gamers. 

The results show the presence of correlations strong enough to 
contradict local realism. Maybe that’s how 30 November 2016 might 
be remembered: the day the people of the world came together to test 
quantum theory. = 


of-choice’ 
loophole.” 


Helping hands 


Awareness of mental health must be matched 
with steps such as better training for supervisors. 


week is Mental Health Awareness Week in the United Kingdom. 
And awareness is certainly on the rise. Already this year, reports, 
surveys and studies have highlighted psychological struggles experi- 
enced by the old, the young, schoolchildren, men, women, soldiers, 
immigrants and refugees, football players, dancers, actors, social-media 
users, musicians, and elite athletes immediately after the Olympic 
Games. As quoted often, one in four people have a mental-health con- 
dition. And rates of depression and anxiety reported by postgradu- 
ate students are unacceptably high. This week, Nature is working to 
improve awareness of how mental illness can affect researchers: in this 
issue’s Careers (page 267) and Comment (page 160) sections, several 
scientists share their experiences with honesty and admirable courage. 
Awareness on its own is obviously not enough. John Lennon wrote 
that life is what happens while you're busy making other plans. Well, a 
life with mental illness can feel a lot like something that happens while 
well-meaning people are busy raising awareness. So, how do we make 
sure that those affected actually feel heard, supported and better? 
Nature is trying to play a small part. Last month, we received a stag- 
gering response from readers to a Careers item about the alarmingly 
high rates of mental-health concerns reported by postgraduates. We 
invited people to tell us their stories, which we collected through a con- 
fidential online form. Our editors hoped to find some examples of suc- 
cess that we could share. Yet, almost without exception, the outpouring 
of 300-plus stories we received were from people who wanted support 


I: Mental Health Awareness Month in the United States, and next 
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but were getting little, if any. (Of course, those who have found support 
might be less likely to tell their tale.) We printed five respondents’ sto- 
ries, with their kind permission, last week (Nature 557, 129-131; 2018). 

Most who replied were postgraduate students and postdocs, but 
several established scientists also wrote in to point out that mental- 
health problems are not confined to the young. We want to thank all 
those who responded so openly: it was harrowing reading, and will help 
to drive our future coverage of these issues. As many people struggling 
with their mental health eventually realize, it rarely helps to keep quiet. 
Reach out to someone and you'll probably be surprised at how readily 
they acknowledge what you're going through. Perhaps they are, too. 

One problem is that, according to a report last year by RAND 
Europe, “the evidence around the effectiveness of interventions to 
support the mental health of researchers specifically is thin. Few inter- 
ventions are described in the literature and even fewer of those have 
been evaluated” (see go.nature.com/2juanaw). 

Some efforts are already under way to help postgraduates, beyond 
simply raising awareness. In March, the Higher Education Funding 
Council for England announced it would put a total of £1.5 million 
(US$2 million) towards improving mental health at 17 universities. 
Several schemes will endeavour to better train and equip PhD supervi- 
sors to mentor their students. This is much needed: a dysfunctional 
supervisor-student relationship was acommon complaint from many 
who wrote to us. 

Manipal Academy of Higher Education in India has established an 
independent, confidential student-support centre so that students can 
come directly to ask professional psychologists for immediate help. 
Meanwhile, the Francis Crick Institute in London has more than 20 reg- 
istered mental-health first aiders, who are trained to recognize mental- 
health issues, provide initial help and guide people towards professional 
services where appropriate. These and other examples of good practice 
and sources of support are collated in a dedicated page on our website 
(see go.nature.com/2i9a6yx). We hope they inspire more. = 
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assault have emerged across the spectrum of science. Nature has 
already run several stories on the topic just this quarter. 

When I talk to senior scientists, many view harassment as an injustice 
that happens somewhere else, not in their field or at their institution. 
But data suggest that the problem is ubiquitous. In separate surveys of 
tens of thousands of university students across Australia, the United 
Kingdom and the United States, upwards of 40% of respondents say 
that they have experienced sexual harassment. A survey last year by the 
US National Postdoctoral Association found that 28% of respondents 
reported experiencing at least one instance of harassment while they 
were trainees; offenders were predominantly reported as being faculty 
or staff members (go.nature.com/2ju83ox). Neither are faculty members 
safe from mistreatment by colleagues. 

Research culture and policies are quick to 
denounce plagiarism, data fabrication and mis- 
management of funds, yet we have too long 
ignored the mistreatment of people. 

Science is a social endeavour; ignoring 
harassment perpetuates a culture in which 
people who experience or witness hostile behav- 
iours are afraid to speak up, cannot do their best 
work, or leave science altogether. Last September, 
the American Geophysical Union (AGU) defined 
harassment, bullying and discrimination as scien- 
tific misconduct, and outlined consequences. The 
greater scientific community should do the same. 

My colleagues and I are developing a pro- 
gramme to reduce harassment in the geosciences 
— one of the least diverse scientific disciplines — supported by the US 
National Science Foundation (NSF). The project, ADVANCEGeo, 
equips bystanders to respond to and prevent harassment in the field, 
lab, office and at conferences, and advocates for inclusion of the subject 
in courses on ethical research conduct. 

Our efforts are part of a growing movement. Just this year, both the 
US NSE and the UK Wellcome Trust put forward policies targeting 
harassment by grant recipients. Many scientific societies have codes of 
conduct that specifically prohibit harassment at meetings, and some, 
such as the American Astronomical Society and the Geological Soci- 
ety of America, address the professional treatment of others in their 
general codes of ethics. 

Harassment, bullying and discrimination damage science at the 
individual, community, institutional and societal levels. The behav- 
iours cause health problems, fear, mistrust, depression and trauma. 
That results in decreased productivity and the exclusion of people who 
might have led highly satisfying scientific careers and made important 
contributions. 

Less-represented populations are disproportionately affected. A 2017 
study in astronomy and planetary science found that women of colour 
were more likely than other groups to report skipping professional 


LE the past year, allegations of egregious sexual harassment and even 


IMPROVED ETHICS 
TRAINING SHOULD 


LAY BARE 
POWER DYNAMICS 
AND BEHAVIOURS 

THAT ALLOW 


HARASSMENT. 


A personal take on events 


Harassment should count 
as scientific misconduct 


Scientific integrity needs to apply to how researchers treat people, not just to 
how they handle data, says Erika Marin-Spiotta. 


events because they felt unsafe (K. B. H. Clancy et al. J. Geophys. Res. 
Planets 122, 1610-1623; 2017). Ina2016 survey of physicists identifying 
as lesbian, gay, bisexual, transgender or queer (LGBTQ), one-third of 
respondents reported that they had considered leaving their institutions; 
this group was also more likely to have experienced or witnessed hostile 
behaviours (see go.nature.com/2wczfih). Other groups are also likely to 
be vulnerable, but data are sparse. 

Why aren't the laws already in place against harassment sufficient? 
People cannot count on their enforcement, especially given that imbal- 
ances of power in academia favour the perpetrators. Fear of retaliation 
also keeps people from reporting to employers in the first place. 

Defining harassment as misconduct provides more ways of deter- 
ring it. For example, the AGU has developed processes for investigating 
allegations, including for addressing concerns that 
might not rise to the level of a formal complaint. 
Sanctions might include being barred from meet- 
ings or publishing in society journals, or the denial 
of an award. To be clear, when talking about har- 
assment, I am not referring to socially awkward 
interactions but to well-defined and documented 
behaviours — such as unwanted groping and 
requests for sexual favours — that create a hostile 
work environment. 

A better sense of how and how often harass- 
ment happens in science would help to convince 
the community ofits pervasiveness, and counter 
mistaken beliefs that it is not common in the 
workplace. We also need more data on the 
experiences of people from under-represented 
groups, and on how hostile climates affect efforts to maintain diversity. 

We must craft effective interventions. Courses in responsible 
research conduct, currently a requirement for trainees, should cover 
how to prevent and respond to harassment and bullying. Institu- 
tions should offer in-person training, including discipline-specific 
scenarios that people can relate to. Departments should commu- 
nicate their workplace values publicly. This is starting to happen. 
For example, the geosciences department at Middle Tennessee 
State University in Murfreesboro has crafted a code of conduct that 
describes principles and practices for professional behaviour, adapted 
from the AGU’. Scientific societies have also made plans to offer 
bystander-intervention training at their conferences. 

Improved ethics training across career levels should draw from 
the social and behavioural sciences to lay bare the power dynamics 
and behaviours that allow harassment, bullying, racism and sexism 
to persist in the sciences. The integrity of the scientific enterprise 
demands that we stop tolerating such behaviours. m 


Erika Marin-Spiotta heads the Biogeography and Biogeochemistry 
Research Group at the University of Wisconsin-Madison, USA. 
e-mail: advancegeopartnership@gmail.com 


10 MAY 2018 | VOL 557 | NATURE | 141 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


SEVEN DAYS nescniss 


Scientist detained 


A British-Iranian professor 

is being detained in Iran 
following his arrest during 

a trip to attend an academic 
workshop in Tehran, 
according to Iranian media 
reports. Abbas Edalat, a 
computer scientist and 
mathematician at Imperial 
College London, was detained 
by Iran’s Revolutionary Guards 
in April for his alleged role 

in spying activities. Reports 
suggest that the authorities 
confiscated his computer, 
notebooks and CDs. Edalat 
founded a pro-peace 
organization, the Campaign 
Against Sanctions and Military 
Intervention in Iran. Imperial 
College says it is “concerned 
for his welfare’, and is urgently 
seeking further information 
about the incident. 


PNAS resignation 


On 1 May, Inder Verma, 

a cancer researcher at the 
Salk Institute for Biological 
Sciences in La Jolla, California, 
resigned as editor-in-chief 
of the journal Proceedings 

of the National Academy of 
Sciences. The move comes 
after the publication of an 
investigation by Science, 

in which several female 
researchers who were either 
at the institute or had ties to 
it between 1976 and 2016 
allege that Verma harassed 
them. Verma, who served on 
powerful committees at the 
institute, vehemently denied 
the allegations in a statement 
to Nature. The Salk Institute 
suspended him on 21 April 
while it investigates the claims. 


Biologist sacked 
The University of Tokyo 
announced on 27 April that 

it has dismissed cell biologist 
Yoshinori Watanabe. The 
decision followed a university 
investigation that found 
manipulated images in five of 


Deadly dust storms hit India 


Severe dust storms occurring in the lead up to 
the monsoon season have killed more than 100 
people in the Indian states of Uttar Pradesh and 
Rajasthan in the past week. Most deaths occurred 
when buildings collapsed in unusually intense 
winds that approached speeds of 130 kilometres 
an hour. On 3 May, India’s Meteorological 


Department said that the storms followed 
‘intense heating’ over the plains of northwestern 
India, with temperatures in the region reportedly 
reaching 45-50°C. Meteorologists say that high 
temperatures might have intensified the storms. 
The high fatality rate was probably due to the 
storms striking while people slept. 


Watanabe’s papers, which the 
institution said amounted to 
scientific misconduct. One 
paper has been retracted, 

and two others have been 
corrected. During his career, 
Watanabe made substantial 
contributions to scientists’ 
understanding of how cells 
divide. Watanabe says that the 
problems that were identified 
in the five papers did not affect 
the studies’ conclusions. He 
says that he had no intention 
of misleading others, and 
believes he did not commit 
misconduct. Watanabe told 
Nature that he resigned on 

28 February, before his official 
dismissal. He will now spend 
a year retraining in data 
acquisition and presentation 
at the laboratory of Nobel 
prizewinner Paul Nurse, at 
the Francis Crick Institute in 
London. 
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Apology for ritual 
The Chinese Academy of 
Sciences has apologized for 

a Taoist ceremony organized 
by construction workers 

that included the slaughter 

of a lamb to inaugurate the 
construction of a nuclear 

test reactor. Images showing 
a monk chanting while the 
lamb was killed sparked 
online controversy in China. 
The April ceremony marked 
the start of construction ofa 
thorium molten salt reactor 
in north-central China, 

part of a 22-billion yuan 
(US$3.5-billion) project to 

be operated by the academy's 
Shanghai Institute of Applied 
Physics. In a statement posted 
on its homepage on 30 April, 
the academy said it apologized 
because the ritual was “against 


the spirit of science” and that 
institute staff did not know the 
ritual was planned. 


US biobank begins 


An ambitious project to track 
the health of one million 
people in the United States 
for at least ten years will 

start enrolling participants 
on 6 May. The US National 
Institutes of Health initiative 
— announced by former 
president Barack Obama 
during his 2015 State of the 
Union address — aims to 
recruit volunteers who range 
in geographical locale, ethnic 
background, health conditions 
and age. The biological 
information that researchers 
will collect from participants 
include blood samples, 
biometric data from devices 
such as Fitbits, and medical 
records. The project will 


HINDUSTAN TIMES/GETTY 


IAN ALDERMAN 


SOURCE: R. ROSENBERG ET AL. MORB. MORTAL. WKLY REP. 67, 496-501 (2018). 


accept only people over age 18 
initially, but plans to include 
children in the future. 


Day Zero delayed 


Cape Town, South Africa, has 
averted Day Zero — the date 
officials predicted the city’s 
reservoirs would run too low 
to deliver potable water to 

the metropolis. Citizens have 
saved the city — for now — by 
reducing their water usage 
from 1.2 billion litres a day in 
late 2016 to 450 million litres a 
day this year, said Helen Zille, 
Cape Town's former mayor, 
on 3 May. She cautioned that 
the threat could return next 
year, and that its timing would 
depend on how much rain the 
region receives in August. 


PUBLISHING 


Nature’s new editor 
Geneticist Magdalena 

Skipper will be the next 
editor-in-chief of Nature, 

the journal announced on 

2 May. Skipper (pictured), 

who is currently editor of the 
open-access journal Nature 
Communications, willbe 

the first woman to head the 
149-year-old title. She will take 
over from Philip Campbell, 
who has led Nature since 1995, 
on 1 July. Skipper says that 
during her tenure, she wants to 
continue the journal's work to 
ensure that scientific findings 
are reproducible and robust, 
particularly in the age of big 
data. She would also like Nature 


TREND WATCH 


Diseases spread by mosquitoes, 
ticks and fleas are rising in the 
United States, according to the 
US Centers for Disease Control 
and Prevention. Between 

2004 and 2016, total yearly cases 
ofa combination of 16 diseases 
tripled. Tickborne illnesses — 
Lyme disease, in particular — 
made up more than 75% of the 
nearly 643,000 cases during the 
whole period. Mosquito-borne 
infections, such as Zika and 
chikungunya, tend to spike with 
outbreaks, whereas the incidence 
of Lyme disease has risen gradually. 


to focus more on early-career 
researchers. See go.nature. 
com/2hx8woa for more. 


Pollution report 


Air pollution is decreasing 

in Chinese cities, but 
increasing in cities in Pakistan, 
Bangladesh and India. 
According to the latest global 
air-quality report by the 
World Health Organization 
(WHO), released on 2 May, 
cities in those three countries 
have experienced the 

steepest rises in air pollution 
since 2010. In China, the 
average annual levels of 
small, dangerous particles 
known as PM, , (those with a 
diameter of 2.5 micrometres 
or less) have dropped by 17% 
since the 2017 report. This 
reduction was achieved by the 
government enforcing strict 
limits on industrial emissions, 
increasing renewable power 
and increasing access to 
cleaner fuels for indoor heating 


and cooking. But dozens of 
Chinese cities still rank among 
the 100 most polluted, and an 
estimated 2 million people die 
in China every year because 

of air pollution. The WHO 
survey reviewed air-quality 
statistics in 4,300 major 

cities for 2016. It found that 
more than 90% of the world’s 
population breathes unsafe air. 


Carbon hits high 


The average concentration of 
atmospheric carbon dioxide, 
a key greenhouse gas, for the 
month of April exceeded 410 
parts per million, researchers 
announced on 2 May. This 

is the first time in recorded 
history that the monthly CO, 
average has breached this 
threshold. Climate scientists 
took the measurements at the 
Mauna Loa Observatory in 
Hawaii, where they have been 
monitoring atmospheric CO, 
levels since 1958. Average 
global temperature has already 
increased by 1°C from the 
pre-industrial era. Continued 
CO, accumulation in the 
atmosphere puts Earth on 

a path to reaching 2°C of 
warming later this century. 


Mission launch 
NASA launched its Mars 
InSight spacecraft from 
Vandenberg Air Force Base 
in California on 5 May. The 


INSECT-BORNE DISEASES ON THE RISE 


The number of infections from ticks, fleas and mosquitoes tripled in 
the United States between 2004 and 2016. 
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probe will record seismic 
activity on the red planet, 
including tremors known as 
marsquakes. Researchers hope 
to use the information to gauge 
the size of Mars’s core, which 
could teach scientists about 

its density and composition, 
and the boundaries between 
the planet's core, crust and 
mantle. InSight will land in 
Elysium Planitia, a flat area 
near the Martian equator, 

on 26 November 2018. The 
international effort includes a 
seismometer built by a French 
team, and a heat-flow probe 
built by a German group. 


Pp FUNDING 
Climate grants 


Twenty-seven climate scientists 
from around the world are 
heading to work in France 

and Germany, as part of the 
second round of French 
President Emmanuel Macron’s 
Make Our Planet Great 
initiative. Macron launched the 
€60-million (US$70-million) 
grant scheme last June in 
response to US President 
Donald Trump deciding to 
pull out of the Paris climate 
accord. Germany joined the 
grant scheme last September 
and added €15 million in 
government funding, which 
was matched by participating 
research organizations. The 
joint programme offers 4-year 
grants of up to €1.5 million 

for senior scientists and up 

to €1 million for early-career 
researchers. The second-round 
winners were announced on 

2 May; 14 will work in France 
and 13 in Germany. Thirteen of 
the scientists currently work in 
the United States. 


European science 
The European Commission 
has proposed a budget of 
€100 billion (US$120 billion) 
for its next major research- 
funding programme. Named 
Horizon Europe, it will last 
from 2021 to 2027, and follows 
the current programme, 
Horizon 2020. See page 150 
for more. 
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CERN 


NEWSIN FO 


Yellowstone 
grizzlies could be hunted 
again p.148 
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The pixel detector at CERN’s CMS experiment records particles that emerge from collisions. 


PARTICLE PHYSICS 


Physicists turn to AI to cope 
with CERN collision deluge 


Competition with cash rewards seeks better techniques for tracking particle debris. 


BY DAVIDE CASTELVECCHI 


hysicists at the world’s leading atom 
Pp smasher are calling for help. In the next 

decade, they plan to produce up to 
20 times more particle collisions in the Large 
Hadron Collider (LHC) than they do now, 
but current detector systems arent fit for the 
coming deluge. So last week, a group of LHC 
physicists teamed up with computer scientists to 
launch a competition to spur the development 


of artificial-intelligence (AI) techniques that can 
quickly sort through the debris of these colli- 
sions. Researchers hope that these methods will 
help the experiment’s ultimate goal of revealing 
fundamental insights into the laws of nature. 
At the LHC at CERN, Europe’s particle- 
physics laboratory near Geneva, Switzerland, 
two bunches of protons collide head-on inside 
each of the machine’s detectors 40 million 
times a second. Every proton collision can pro- 
duce thousands of new particles, which radiate 


from a collision point at the centre of each 
cathedral-sized detector. Millions of silicon 
sensors are arranged in onion-like layers and 
light up each time a particle crosses them, 
producing one pixel of information every 
time. Collisions are recorded only when they 
produce potentially interesting by-products. 
When they are, the detector takes a snapshot 
that might include hundreds of thousands of 
pixels from the piled-up debris of up to 20 dif- 
ferent pairs of protons. (Because particles 
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> move at or close to the speed of light, a 
detector cannot record a full movie of their 
motion.) 

From this mess, the LHC’s computers 
reconstruct tens of thousands of tracks in 
real time, before moving on to the next 
snapshot. “The name of the game is con- 
necting the dots,” says Jean-Roch Vlimant, 
a physicist at the California Institute of 
Technology in Pasadena who is a member 
of the collaboration that operates the CMS 
detector at the LHC. 

After future planned upgrades, each 
snapshot will be expected to include par- 
ticle debris from 200 proton collisions. 
Physicists currently use pattern-recogni- 
tion algorithms to reconstruct the particles’ 
tracks. Although these techniques would 
be able to work out the paths even after the 
upgrades, “the problem is, they are too slow’, 
says Cécile Germain, a computer scientist 
at the University of Paris South in Orsay. 
Without major investment in new detector 
technologies, LHC physicists estimate, the 
collision rates will exceed the current capa- 
bilities by at least a factor of ten. 

Researchers suspect that machine-learn- 
ing algorithms could reconstruct the tracks 
more quickly. To help find the best solution, 
Vlimant and other LHC physicists teamed 
up with computer scientists, including 
Germain, to launch the TrackML challenge. 
For the next 3 months, data scientists will be 
able to download 400 gigabytes of simulated 
particle-collision data — the pixels pro- 
duced by an idealized detector — and train 
their algorithms to reconstruct the tracks. 

Participants will be evaluated on the 
accuracy with which they do this. The top 
three performers will receive cash prizes of 
US$12,000, $8,000 and $5,000. 


PRIZE APPEAL 
Such competitions have a long tradition in 
data science, and many young researchers 
take part to build up their CVs. “Getting 
well ranked in challenges is extremely 
important,” says Germain. In a similar 
competition in 2014, teams competed to 
‘discover’ the Higgs boson in a set of simu- 
lated data (the LHC discovered the Higgs, 
long predicted by theory, in 2012). 
TrackML is “incomparably more 
difficult’, says Germain. She thinks the 
winning technique might end up resem- 
bling those used by the program AlphaGo, 
which made history in 2016 when it beat a 
human champion at the complex game of 
Go. In particular, the methods might use 
reinforcement learning, in which an algo- 
rithm learns by trial and error on the basis of 
‘rewards that it receives after each attempt. 
Vlimant and other physicists are also 
beginning to consider more untested tech- 
nologies, such as quantum computing. “It’s 
not clear where we're going,’ says Vlimant, 
“but it looks like we have a good path” m 
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Grizzly bears in the greater Yellowstone ecosystem could be targeted by hunters as early as September. 


CONSERVATION 


Hunting proposal 
might threaten bears 


Biologists argue that the plan could endanger grizzlies in 
iconic ecosystem that includes Yellowstone National Park. 


BY GIORGIA GUGLIELMI 


n 23 May, Wyoming officials will vote 
() on whether to allow the hunting of up 

to 24 grizzly bears around Yellowstone 
National Park this September. The proposal 
has reignited controversy over whether or not 
this population has recovered from decades 
of hunting and habitat destruction — an issue 
central to the US government's decision to take 
the bears in the greater Yellowstone ecosystem 
off the endangered-species list in 2017. 

Seventy-three scientists sent a letter to 
Wyoming Governor Matt Mead on 25 April, 
asking him to halt the hunt until independent 
experts can review data on the size of the grizzly 
(Ursos arctos horribilis) population in this area. 
They are concerned that government tallies 
overestimate the number of bears in this region, 
which spans roughly 80,000 square kilometres 
and is one of the largest continuous wilderness 
areas in the contiguous United States. 

Critics challenge the federal government's 
methods of assessing whether the grizzly 
population is large enough to face a hunt- 
ing season. The estimates might be too high 
because of several factors, says David Mattson, 
a wildlife researcher in Livingston, Montana, 
who retired from the US Geological Survey 
(USGS) in 2013. These include increased 


monitoring efforts in the past 30 years and bet- 
ter visibility of bears to aerial surveys because 
of shifts in where they look for food. 

Wildlife scientist Frank van Manen, who 
leads the USGS Interagency Grizzly Bear 
Study Team (IGBST) in Bozeman, Montana, 
disagrees with critics of the government $ 
estimates. The IGBST collects population 
data using a range of methods, including aerial 
surveys and tagging bears, van Manen says, 
and the numbers from each method agree. 
The current population estimate of 718 bears 
is “extremely conservative’, he says. 

The Wyoming Game and Fish Department 
proposed the hunt in February on the basis 
of those population assessments, and gave 
the public until 30 April to comment on draft 
regulations. Under the proposal, hunters could 
kill up to 12 bears in the monitored region 
surrounding Yellowstone National Park, and 
a further 12 bears outside that area (but still in 
the greater Yellowstone ecosystem). 

Mattson and the other researchers who 
wrote to the governor about the hunt listed 
several concerns in their letter. Some of 
the bear’s food, such as cutthroat trout 
(Oncorhynchus clarkii), will probably become 
even scarcer in the future as a result of environ- 
mental changes, they say. This would push the 
animals to hunt livestock or look for food near 
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houses, increasing their run-ins with people, 
says Mattson. If the number of bears killed as 
a result of these conflicts increases, this would 
further shrink the population. 

Even if the current population estimates 
are accurate, removing 24 animals through 
hunting could have detrimental effects, 
says Andrea Santarsiere, an attorney at the 
Center for Biological Diversity who is based 
in Victor, Idaho. In 2017, 56 bears died in the 
IGBST monitoring area as a result of natural 
causes or conflicts with people. “If the same 


amount dies this year, we could be looking at 
up to 80 bears removed from the population,” 
Santarsiere says. 

Killing females might pose even higher risks 
to the survival of these grizzlies, she says. The 
Wyoming proposal would allow the killing 
of up to two females in the IGBST monitor- 
ing area, but it doesn’t cap how many females 
hunters can take outside this region. Females 
can carry up to four cubs at a time, Santarsiere 
says, “so killing one female could equal 
removing five bears from the population”. 
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Van Manen says the hunting proposal 
won't pose a risk to the bear population. 
Only two hunters at a time would be allowed 
in the monitoring area, and the hunts would 
stop as soon as two females had been killed, 
he says. 

Wyoming officials seem intent on allowing 
the hunt, says Louisa Willcox, a wildlife activist 
based in Livingston, Montana, who has been in 
contact with Wyoming's wildlife department. 
“It’s extremely unlikely that the scientists’ 
comments will make them pause.” m 


PETAR B/REX/SHUTTERSTOCK 


Wellcome Trust vows to 
pull grants from harassers 


UK charity launches policy to force institutions to report bullying or sexual misconduct. 


BY HOLLY ELSE 


ne of the world’s largest research- 
() funding charities is cracking down on 

harassment and bullying. Scientists 
who have been sanctioned by their institutions 
could lose out on funding from the Wellcome 
Trust, under rules announced on 3 May. 

It is the first major UK research funder to 
institute such a policy; the US National Science 
Foundation introduced similar rules earlier 
this year. 

Wellcome’s policy will come into force for 
new and existing grant applications on 1 June. 
It will apply to anyone already associated 
with a grant, including those whose projects 
are already under way. It gives Wellcome, a 
biomedical-research charity in London, the 
right to withhold funding from a researcher 
or to bar them from applying for future grants. 

The policy also means that sanctions can be 
levied against institutions that fail to disclose 
details of such misconduct, do not investigate 
allegations in a timely and fair manner, or 
take inappropriate action. In extreme circum- 
stances, sanctions could include suspending 
funding from an entire organization. 
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Billionaires, 


“Bullying and harassment are just plain 
wrong,’ says Alyson Fox, director of grants 
at the charity. These behaviours are harmful, 
and therefore affect the research that 
Wellcome funds, she adds. The policy “will 
give organizations notice that we are taking 
this extremely seriously”. 

The Wellcome Trust funded more than 
900 grants, worth a total of more than £1 billion 
(US$1.4 billion), in 2017. 

Under the new guidelines, Wellcome will 
require organizations that receive its grants 
to have clear policies that outline standards of 
acceptable behaviour by staff and procedures 
for responding to allegations of harassment 
and bullying. 

The policy defines bullying as a misuse of 
power that can make people feel vulnerable, 
upset, humiliated, undermined or threatened. 
It says harassment is unwanted physical, verbal 
or non-verbal conduct that has the purpose 
or effect of violating someone else’s dignity, or 
creating an intimidating, hostile, degrading, 
humiliating or offensive environment for them. 

Six types of sanction can be applied to grant 
holders and Wellcome advisory committee 
members whose employers have investigated 


@ Water filter inspired by Alan Turing 


and upheld an allegation of bullying or 
harassment. They include removing researchers 
from grants, and banning them from super- 
vising Wellcome-funded PhD students or 
submitting future grant applications. 

Institutions that do not abide by the policy 
risk being temporarily barred from applying 
for Wellcome grants. In extreme cases, they will 
have existing funded suspended. 

Emma Chapman, an astrophysicist at 
Imperial College London and a member of 
the 1752 group, which lobbies against sexual 
misconduct in higher education, calls the 
harassment policy an “excellent step forward”. 
However, she worries that it could lead univer- 
sities to settle complaints informally to hide 
problems. The requirement to report only 
upheld allegations is understandable, Chap- 
man adds, but it risks missing researchers who 
resign before an investigation is completed. 

Philip Maini, a biological mathematician at 
the University of Oxford, UK, also questions 
how effective the policy will be. “Ifan institution 
has someone bringing in huge amounts of over- 
head and publishing in Nature and Science,’ 
Maini says, “are they really going to take action 
against them if they are a bully? I think not? m 


NATURE PODCAST 


Al and our sense 


butterflies and passes first test go.nature.com/2jgkjxr of place; liquid 
brooding skies @ Nature announces new crystals deliver 
—April’s top editor-in-chief go.nature.com/2hx8woa cargo; and 
science images | @ Report harassment or risk losing [ | depression in 
go.nature. funding, says top UK science funder academia nature. 
com/2hzmtxy go.nature.com/2khaGss com/nature/podcast 


10 MAY 2018 | VOL 557 | NATURE | 149 


Limited, part of Springer Nature. All rights reserved. 


| NEWS | IN FOCUS 


RESEARCH FUNDING 


Europe’s €100-billion science plan 


Budget proposed for European Union’s next big research-funding programme. 


BY INGA VESPER 


r Vhe European Union is planning to spend 
€100 billion (US$120 billion) on its next 
major research-funding programme, 

for 2021 to 2027 — a disappointment to some 

scientists and policy groups who had been 
hoping for up to 60% more. The budget does 
not include a contribution from the United 

Kingdom, whose departure from the bloc in 

2019 is likely to shake up the distribution of 

funds among the remaining 27 EU countries. 

The European Commission issued its open- 
ing budget proposal for Framework Programme 
Nine — newly named Horizon Europe, and the 
successor to the current programme, Hori- 
zon 2020 — on 2 May. The announcement 
marks the start of tough negotiations between 
the European Parliament and the Council of the 
European Union, which comprises government 
representatives from EU nations. 

The proposed €100 billion is an increase on 
the €77-billion pot for Horizon 2020, which 
began in 2014 (see “Europe's science spend- 
ing’). However, a report by influential aca- 
demic and industry experts, published last July, 
had urged a doubling of the budget for the next 
framework programme. 


EUROPE’S SCIENCE SPENDING 


The European Commission has proposed a €100-billion (US$120-billion) budget for Horizon Europe, 
the next instalment of its flagship research-funding programme, which will last from 2021 to 2027. 
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Euratom’s budget is included as part of the framework’s total funding, but its grants are distributed through a separate programme. 


And in March, 13 science and higher- 
education organizations, including the 
European University Association (EUA), 
which represents more than 800 institutions, 
also demanded a €160-billion budget. “The 
increase is good, but it’s not at the level we 
would consider suitable,’ says Enora Bennetot 
Pruvot, deputy director of governance, funding 
and policy at the EUA in Brussels. 

“With the UK leaving the EU, we knew it 


was going to be difficult to get the €160 billion 
we would have liked to see,” says Laura 
Keustermans, senior policy officer at the 
League of European Research Universities in 
Leuven, Belgium. The full proposal for Horizon 
Europe is expected in June. The framework is 
set to include funding for large, multidiscipli- 
nary ‘missions to tackle big societal questions; 
€10 billion has also been earmarked for research 
into food, agriculture and rural development. m 


SOURCE: EUROPEAN COMMISSION 


BIOMEDICINE 


Anti-cancer viruses take off 


Encouraging study results and a handful of clinical trials spur interest in therapy approach. 


BY HEIDI LEDFORD 


announced on 2 May that it would pay 

up to US$1 billion to acquire a company 
that makes cancer-killing viruses. The striking 
show of support for a still-unproven treatment 
is just the latest sign that industry and academ- 
ics are warming to the approach. 

In February, Merck, headquartered in 
Kenilworth, New Jersey, agreed to pay US$394 
million to snatch up an Australian firm work- 
ing on cancer-killing, or ‘oncolytic, viruses. 
And in April, 300 people showed up for the 
oversubscribed International Oncolytic Virus 
Conference in Oxford, UK. When the confer- 
ence launched in the early 2000s, there were 
only about 60 attendees. “They were very small 
meetings for these crazy people working with 
viruses,’ says Jean-Simon Diallo, a molecular 


P harmaceutical giant Johnson & Johnson 
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biologist at the Ottawa Hospital Research 
Institute. “We've really seen a shift.” 

Diallo credits a couple of developments 
with igniting the field. One was a 2015 US 
Food and Drug Administration (FDA) deci- 

sion to approve a 


People with modified herpes 
cancer virus called talimo- 
sometimes gene laherparepvec 
gointo (Imlygic) to treat 
remission after some forms of mela- 
contracting a noma. It was the first 
viral infection. cancer-fighting virus 


to win regulatory sup- 
portin the US market. Another development is 
emerging evidence — largely from animal stud- 
ies — that the viruses might work better when 
administered in concert with therapies called 
checkpoint inhibitors, which boost immune 
responses against tumours. 


“The intersection of these two events has 
really put some spice in the oncolytic-virus 
field,” says Diallo. The checkpoint inhibitors 
in particular turned things around, he adds. 

Researchers have been trying to develop 
cancer-fighting viruses for decades, hoping to 
capitalize on centuries-old observations that 
people with cancer sometimes go into remis- 
sion after contracting a viral infection. That 
has spurred teams to develop a panoply of 
viruses that have passed through the gauntlet 
of a clinical trial. 

Many of these trials have met with little 
success. Even Imlygic fell short of show- 
ing a statistically significant improvement in 
patient survival during a clinical trial (R. H. I. 
Andtbacka et al. J. Clin. Oncol. 33, 2780-2788; 
2015). Still, the results were enough to persuade 
the FDA to approve the therapy for melanomas 
that had resisted other treatments. That study 
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also sparked hope among researchers by show- 
ing that a virus injected directly into one tumour 
could rein in tumours elsewhere in the body. 


DEATH BY ASSOCIATION 

The viruses work by generating an immune 
response. After the virus infects and kills can- 
cer cells, the immune system eliminates the 
virus and ends up also getting rid of the dead 
cancer cells. “The side effect from eliminating 
the virus is that the systemic immune system 
recognizes the cancer cells,” says Tomoki Todo, 
a neurosurgeon at the University of Tokyo. 
“Then it starts to attack even those cancer cells 
that are not infected by the virus” 


Scientists reasoned that bolstering such an 
immune response — for example, by using 
a checkpoint inhibitor — could amplify this 
indirect effect. These inhibitors sometimes 
send cancer into remission for years, but only 
for a fraction of people. 

Studies in mice suggest that combining 
checkpoint inhibitors with cancer-killing 
viruses might boost that percentage. And in 
a small clinical trial involving 21 people with 
advanced melanoma, Imlygic, together with 
a checkpoint inhibitor, significantly shrank 
tumours in 62% of participants and wiped 
them out altogether in 33% (A. Ribas et al. Cell 
170, 1109-1119; 2017). 
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When researchers have combined check- 
point inhibitors with other treatments, they 
have met with mixed success. Some fervently 
anticipated combinations have failed in clinical 
trials. “Could the same thing happen with onc- 
olytic viruses? Sure,’ says Dmitriy Zamarin, 
an oncologist at the Memorial Sloan Kettering 
Cancer Center in New York City. 

But Zamarin and others are cautiously 
optimistic. Many checkpoint-inhibitor com- 
binations target a specific protein, Zamarin 
notes, whereas oncolytic viruses provoke a 
much broader immune response that can tar- 
get cancer in a number of different ways. “That 
gives us some comfort,’ he says. m 


NASA/JPL-CALTECH 


QUANTUM PHYSICS 


Universe’s coolest lab set to 
open up quantum world 


NASA’s Cold Atom Laboratory will allow physicists to play with quantum phenomena 


like never before. 


BY ELIZABETH GIBNEY 


uantum physicists are about to get 

their own playground in space. NASA’ 

Cold Atom Laboratory, scheduled to 
launch to the International Space Station on 
20 May, is set to be the coldest place in the 
known Universe. Researchers will use the lab 
to probe quantum phenomena that would be 
impossible to observe on Earth. 

The US$83-million mission will be used 
to study quantum mechanics on the mac- 
roscopic scale by making a state of matter 
known as a Bose-Einstein condensate (BEC). 
These are clouds of hundreds of thousands of 
atoms that, when chilled to just above abso- 
lute zero, behave as waves that synchronize 
into a single quantum object. “Just being able 
to do these experiments in space I think is a 
huge accomplishment,’ says Kamal Oudrhiri, 
mission manager at the Jet Propulsion Labo- 
ratory (JPL) in Pasadena, California. 

On Earth, gravitational forces usually dis- 
perse these condensates within a few seconds. 
The closest that BECs have come to being in 
space-like conditions is during brief stints in 
a research rocket, or falling from a drop tower 
over 9 seconds. But, floating on the space sta- 
tion, they should be able to exist for at least 
10 seconds. That’s long enough for them to 
be cooled to record-low temperatures — 
perhaps as little as 20 trillionths of a degree 
above absolute zero. That would be the cold- 
est known temperature in the Universe, says 


Oudrhiri. Colder and long-lasting conden- 
sates will “push the frontiers of studying fun- 
damental physics’, says Gretchen Campbell, 
an atomic physicist at the US National Insti- 
tute of Standards and Technology in Gaithers- 
burg, Maryland. “It’s something people have 
hoped for for almost 15 years.” 


DOWN-SIZED KIT 

Real estate on the space station is at a pre- 
mium, so engineers had to crunch down 
atomic-physics equipment that usually fills a 
large room into a cool-box-sized chest. The 
equipment will cool rubidium and potassium 
atoms by scattering laser light off the particles 


in all directions to slow them to almost a 
standstill. It will then use magnetic fields to 
trap the cloud. To create the condensate, other 
cooling techniques are used to push the cloud 
even closer to absolute zero — including 
creaming off the most energetic atoms using 
a radio-wave ‘knife’ and widening the trap to 
let the cloud expand. 

Engineers also had to design shielding to 
protect the delicate condensates from inter- 
ference from densely packed components and 
from Earth’s varying magnetic field. Although 
astronauts will unpack and install the equip- 
ment, the experiments will run only while the 
team is asleep to minimize disruption from > 


Instruments on NASA’s Cold Atom Laboratory will cool atoms to near absolute zero (artist’s concept). 
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> any movements. 

The technology is simpler than initially 
intended, after a more-complex version of the 
lab developed a leak that affected the vacuum 
chamber and threatened to delay the project. 
So physicists will not yet be able to achieve 
their ultimate goal of performing space- 
based atom interferometry — a process that 
involves splitting the condensate’s quantum 
wave in two and recombining the waves. The 
resulting interference patterns would allow 
scientists to analyse the effects of gravity with 
exquisite precision, as well as test whether 
condensates could be used as very sensitive 
rotation and gravity sensors. But the more- 
advanced kit should arrive by the end of 2019, 
says Robert Thompson, project scientist for 
the mission at JPL. 


BUBBLES, RINGS AND WHIRLPOOLS 

The current iteration will still allow new 
physics, says Thompson. Five teams are set 
to experiment with the lab; one plans to use 
radio waves and magnetic fields to trap the 
condensate in a bubble shape about 30 micro- 
metres across — roughly half the width of a 
human hair. Quantum mechanics suggests 
that because the bubble is both thin and 
edgeless, the condensate should behave dif- 
ferently from when it is shaped as a disc or 
sphere on Earth. For instance, it might more 


readily form whirlpools known as vortices, 
says Courtney Lannert, a theoretical physi- 
cist at Smith College in Northampton, Massa- 
chusetts. On Earth, attempts to create bubbles 
always end up with bowl shapes as the fluid 
falls. “We can’t get this shape at all unless we 
can get rid of gravity,” she says. 

A group led by Eric Cornell of the University 
of Colorado, Boulder, who won a Nobel prize 
in 2001 for co-discovering BECs, will try to 

create exotic loosely 


“We can’t get bound systems 
this shape at known as Efimov 
allunless we states. Named after 
can get rid of Russian theoretical 
gravity.” physicist Vitaly Efi- 


mov, who proposed 
their existence in 
1970, these quantum states crop up where 
atoms would bind too weakly to join in pairs, 
but can form threesomes. These are similar 
to Borromean rings — rings linked in such a 
way that the system falls apart if any one ring is 
removed — and are of interest to nuclear phys- 
icists because they have parallels with rare and 
poorly understood three-particle nuclei made 
up of neutrons and protons. The team hopes 
to create the simplest Efimov state but also 
excited, bloated versions in which the atoms 
bind with each other despite being a bacte- 
rium-width apart. The group might also be 


able to make foursomes of such atoms, known 
as tetramers, says Maren Mossman, a physicist 
at Washington State University in Pullman. 

Atomic physicists will find the space-sta- 
tion set-up extraordinary for more-practical 
reasons, says Mossman. They are used to 
building their own equipment and tweaking 
experiments as they go. But with the Cold 
Atom Laboratory, many are sharing a facil- 
ity for the first time, says Mossman, and they 
must run experiments through JPL research- 
ers who are operating the facility from the 
ground. “Folks in particle physics have been 
doing this since the beginning. But it’s so 
weird to us in atomic physics,” she says. 

The process worked out “better than most 
of us expected”, says Thompson, who has 
been working to create such a facility since he 
joined JPL in 1997. He thinks that the current 
version is a step towards even more-complex 
atomic-physics labs in space. NASA is work- 
ing with the German Aerospace Center 
(DLR) to build a facility called BECCAL 
(Bose-Einstein Condensate and Cold Atom 
Laboratory), he says. Many experiments on 
the space station already test the effects of 
low gravity, but for most, the extreme micro- 
gravity is “overkill”, adds Thompson. “We 
are one of the experiments that will really 
highlight what the space station is capable 
of doing.” m 
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AS THE UNITED NATIONS 
PREPARES A HISTORIC 
TREATY TO PROTECT THE 
OCEANS, SCIENTISTS 
HIGHLIGHT WHAT'S 
NEEDED FOR SUCCESS. 


nthe early fifteenth century, Portuguese 

sailors reached a becalmed part of the 

Atlantic Ocean, coated with mats of 

gold-brown seaweed. Under windless 

skies, their ships drifted idly with the 
currents. The sailors named the seaweed Sar- 
gassum — after its resemblance to a Portuguese 
plant — and the region eventually became 
known as the Sargasso Sea. 

Initially thought to be an oceanic desert, 
this part of the Atlantic is now recognized as 
a watery rainforest. It is one of Earth’s most 
rare and valuable marine ecosystems, so rich 
in nutrients that eels travel thousands of kilo- 
metres from rivers in Europe and the Americas 
to breed there. 

But the Sargasso Sea is also one of the dirti- 
est and most damaged parts of the open ocean. 
The gyre of currents that bounds this shoreless 


sea entraps vast amounts of plastic waste, and 
fish stocks are declining in the now-busy ship- 
ping route. 

Scientists want to conserve the Sargasso 
ecosystem, and ten governments have signed a 
non-binding pact to protect it. But their efforts 
are limited owing to a major gap in interna- 
tional policy. Like half of the planet, the Sar- 
gasso Sea doesnt fall under the control of any 
single nation. Countries can protect or exploit 
waters closer than 200 nautical miles (370 
kilometres) to their shorelines, but everything 
outside these ‘exclusive economic zones is con- 
sidered international waters: the high seas. 

The high seas make up two-thirds of Earth's 
oceans, providing 90% of its available habitat 
for life and accounting for up to US$16 billion 
a year in fisheries catch. The oceans are also 
prime territory for the discovery of valuable 
mineral deposits, potent pharmaceuticals 
and oil and gas reserves. The United Nations 
Convention on the Law of the Sea (UNCLOS) 
regulates activities in international waters, 
including sea-bed mining and cable laying; 
a patchwork of 20 or so other organizations 
oversee aspects of international shipping and 
whaling, as well as fishing and conservation at 
the regional level. But no overarching treaty 
exists to protect biodiversity or conserve 
vulnerable ecosystems in the oceans. 

Yet momentum is now building to protect 


Vast expanses of the South 
Pacific, viewed here, are 
unprotected by law. 


the high seas. This September in New York 
City, negotiations begin on a United Nations 
treaty — which is likely to be an add-on to 
UNCLOS — to agree on how to safeguard this 
vast shared resource by setting aside areas for 
conservation and laying out rules for activities 
such as deep-sea mining. The treaty could also 
find ways to help all countries benefit from 
research into deep-sea species — including 
whether marine organisms’ genes and proteins 
might form the basis of new drugs or materi- 
als — either financially or through technology 
transfer. The talks are being heralded as a Paris 
climate accord for the oceans: a vital oppor- 
tunity to conserve the planet's least-explored 
realm. “We have a once in a lifetime chance to 
secure a treaty that will allow nations to manage 
activities on the high seas,’ says Lance Morgan, 
president of the non-profit Marine Conserva- 
tion Institute in Seattle, Washington, which is 
focused on ocean protection. 

The UN, regional fisheries organizations 
and non-profit agencies have already short- 
listed numerous international marine regions 
that — like the Sargasso — deserve protection. 
But researchers are unsure whether politicians 
will heed scientific advice in choosing what 
to protect, and in making judgements about 
environmental impacts. Ahead of the negotia- 
tions, Nature lays out this guide to protecting 
the high seas, and the scientific debates at play. 
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Ocean havens 


There’s no shortage of ideas for marine protected areas (MPAs) on the high 
seas. UN organizations have listed dozens of vulnerable ecosystems, as have 


regional fisheries bodies and non-governmental organizations. This map 
highlights ten sites that showcase the diversity of ecosystems on the high seas 
and the range of threats they face. Data came from the Marine Conservation 


Institute, which has an interactive version at go.nature.com/2hlkked. 


1. Dead zones. Pollutants from agricultural runoff 
can cause plankton blooms in the Bay of Bengal, a 
shallow, warm part of the Indian Ocean. The blooms 
suck up oxygen, leaving dead zones that total at least 
60,000 square kilometres. Further runoff or a change 
in monsoons could cause huge-scale oxygen depletion, 
radically changing an ecosystem that provides jobs and 
food security to more than 100 million people. 


2. First new MPA? East Antarctica, a relatively 
pristine part of the Southern Ocean that is home to 
penguins and corals, is a clear choice for a high-seas 
MPA. But China and Russia have interests in fishing 
krill here; in 2017, it was rejected as an MPA for the 
sixth consecutive year by the Commission for the 
Conservation of Antarctic Marine Living Resources. 


3. Coral crunch. Between the Hawaiian and 
Aleutian islands, a chain of deep-sea volcanoes 
provides nutrient-rich waters for migrating albatrosses, 
whales and tuna. Corals and fish have been hit hard by 
trawling and are struggling to recover. 


4. Shark cafe. Hundreds of great white sharks 
(Carcharodon carcharias) forage and breed here, ina 
region at risk from fishing and shipping. These sharks 
are a genetically distinct population and of higher 
concern even than other great whites; the species as a 
whole could number as few as 3,500 in the wild. 


5. Sea-bed mining. Scattered on and below 
the sea bed are trillions of nodules — potato-sized, 
rock-like deposits rich in many valuable minerals. But 
the region also hosts rare marine species, including 
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HOW TO CORDON OFF THE OCEANS 
A major focus of the treaty will be to agree 
a process to create marine protected areas 
(MPAs) — regions that are off-limits to at least 
some kinds of commercial activity. Established 
properly, MPAs can boost biodiversity in previ- 
ously decimated regions. They can't stop plastics 
entering the ocean, or waters becoming hotter 
or more acidic, but they can increase the size 
and diversity of marine populations, giving 
them a greater resilience to these stressors. 
Scientists say that at least 30% of the global 
ocean, distributed evenly between ocean eco- 
systems, should be cordoned off to avoid a mass 
extinction of marine life. On paper, almost 7% 
of the ocean is now protected: in the past 3 years, 
13 of the world’s largest MPAs, all more than 
100,000 square kilometres in area, have been 
created in coastal waters — largely impelled by 
a UN goal to protect 10% of the ocean by 2020. 
In practice, however, these protections are often 
less than adequate. To be effective, MPAs need 
key traits: they must be ‘no-take} or completely 
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off-limits to commercial activity; have an area 
of at least 100 square kilometres; be permanent 
and physically isolated from their unprotected 
surroundings by deep water or sand; and have 
well-enforced protections. An analysis of 
87 MPAs found that those with only one or two 
of these traits were ecologically indistinguish- 
able from fished sites’. 

Many coastal MPAs allow for oil and gas 
exploration, shipping and fishing. Only 2% of 
the ocean is no-take, and these MPAs are mostly 
in deep tropical waters of little interest to indus- 
try, so do little to reduce overall exploitation of 
the ocean. As for the high seas, just 0.5% is off- 
limits to commercial exploitation. (Much of this 
is due to the largest international MPA, in the 
Ross Sea off Antarctica, which was created by 
a regional 25-nation council). “As is often the 
case closer to shore, there’s a serious risk that 
high-seas MPAs will be sited in areas of low 
commercial interest,” says Elizabeth De Santo, 
an environmental-management specialist at 
Franklin and Marshall College in Lancaster, 


Pennsylvania. How scientific advice on MPAs 
will feed into the UN treaty is yet to be decided. 


MONITORING AND ENFORCEMENT 

Once protected ocean areas have been agreed 
on, it’s crucial to gather baseline data. A 2000-10 
project called the Census of Marine Life pro- 
vided much of what researchers know about life 
in the high seas, but the oceans have become 
warmer, more acidic and more heavily fished 
since then. This need for new data could stimu- 
late a fresh period of discovery. 

Monitoring breaches of protected areas is 
possible thanks to satellite technology. Global 
Fishing Watch (GFW), a satellite-based sur- 
veillance initiative that was launched in 2014 
by the non-profit organizations SkyTruth and 
Oceana, together with Google, allows anyone 
with WiFi access to track fishers in real time, 
for instance. These data suggest that commer- 
cial fishing reaches more than half the ocean, 
spanning an area four times that covered by 
agriculture on land’. 
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a species of ghost octopus that was discovered in 
2016. The International Seabed Authority has issued 
16 contracts to explore the area for minerals. Scientists 
say at least one-third of the zone should be off-limits to 
mining, with controls in place where it is permitted. 


6. Dynamic dome. Strong winds drive currents 
that force cold, nutrient-rich waters to well up from the 
deep to just below the surface. Iconic ocean species 
come here, including mahi-mahi, billfish, sharks, 
squid, cetaceans and endangered sea turtles. But this 
‘thermic dome’ shifts its position, and only seasonally 
occurs on the high seas, so it is challenging to protect. 


7. Marine rainforest. The Sargasso region is one 
of 37 EBSAs, or ‘Ecologically or Biologically Significant 
Marine Areas’ on the high seas. The UN designation 
identifies the regions as important to healthy ocean 
function but does not protect them. 


8. Hydrothermal field. Discovered in 2000, the 
‘Lost City’ system could give clues to the necessary 
precursors for life on Earth. Ata depth of 800 

metres, this acidic, hot ecosystem extends for about 
400 metres along the top of an underwater mountain 
known as the Atlantis Massif. The United Nations 
Educational, Scientific and Cultural Organization has 
proposed a 20-kilometre buffer zone. 


9. Ineffective sanctuary. This refuge, regarded 
as the first ever high-seas MPA, was created in 1999 to 
protect the many cetacean species that visit its waters. 
But the sanctuary lacks management and has had little 
effect. If expanded and implemented properly, it could 
provide refuge for bluefin tuna, sharks and swordfish. 


10. Oil and gas. This 1,800-kilometre mountain 
chain hosts active volcanoes, hydrothermal vents and 
unique creatures such as eyeless shrimp (Rimicaris 
exoculata), which could be vulnerable to shipping and 
oil and gas exploration as the Arctic warms. 


DAVE FLEETHAM/GETTY 


But prosecuting regulatory breaches is a 
political issue — and at the will of indi- 
vidual nations. An analysis of hundreds 
of coastal MPAs has found that staff- 
ing and budgets are the strongest predictors 
of whether an MPA will have a conservation 
impact’. The ecological effects of MPAs with 
enough staff to patrol and monitor activity 
within the reserve were nearly three times 
greater than were those of MPAs with inad- 
equate capacity, researchers found. 


ENVIRONMENTAL ASSESSMENTS 
On land and in coastal waters, new commercial 
activities have to undergo an ‘environmental 
impact assessment or EIA, to weigh up any 
benefits against potential harm to local wildlife. 
On the high seas, only some activities are 
regulated in this way. It wasn’t until 2006 
that bottom trawling — a highly destructive 
fishing method — needed an EIA. Before 
that, it destroyed deep-sea corals. Even now, 
mid-water fisheries, open-water farming and 


JUST 0.5% OF THE HIGH 
SEAS IS OFF-LIMITS TO 
COMMERCIAL EXPLOITATION. 


rocket launches (which dump waste at sea) do 
not need to consider potential environmental 
harm. Scientists want to see new commercial 
activities on the high seas tightly regulated. 
Deep-sea mining, in particular, is likely to 
cause a flashpoint at the UN talks. 

The International Seabed Authority, 
established by UNCLOS, has approved 
29 exploration licences for companies such as 
Lockheed-Martin to do surveys, mostly along 
ridges in the oceans and at hydrothermal vents. 
It is now developing regulations for what EIAs 
mining companies would have to conduct. 

Cindy Van Dover, a deep-sea biologist at 


Duke University in Durham, North Carolina, 
says that scientists are concerned these regula- 
tions will not be strict enough. One unresolved 
issue is whether companies can mine active 
hydrothermal vent sites, which support large, 
diverse biological communities. “We're argu- 
ing that we should protect active hydrother- 
mal vents,” she says. Scientists don’t yet know 
whether these communities can recover from 
mining. “What we've learned from bottom 
trawling is that the recovery time, particularly 
for complex habitats like deep-water corals, 
can potentially be hundreds of years,’ says Paul 
Snelgrove, a deep-sea biologist at the Memo- 
rial University in St Johns, Canada. “I think we 
have to accept that science will not be the only 
deciding factor, but we certainly hope it will 
be one of the major considerations,” he says. = 


Olive Heffernan is a science writer in Dublin. 
1. Edgar, G. J. et al. Nature 506, 216-220 (2014). 


2. Kroodsma, D. A. et al. Science 359, 904-908 (2018). 
3. Gill, D. A. etal. Nature 543, 665-669 (2017). 
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A man waits for treatment at a hospital in Havana, where a drug to treat diabetic foot ulcers was shown to be effective in a clinical trial. 


When will clinical trials 
finally reflect diversity? 


An analysis of drug studies shows that most participants are white, even though trials 
are being done in more countries, reveal Todd C. Knepper and Howard L. McLeod. 


countries contributing to the clinical- 

trial data used by the US Food and 
Drug Administration (FDA) to approve 
drugs has almost doubled (see “Going 
global’). Yet this global expansion of study 
locations has not been accompanied by an 
equivalent increase in the racial diversity of 
people enrolled. In 1997, 92% of the partici- 
pants in these trials were white; in 2014, we 


Sc: the late 1990s, the number of 


found that this figure was still nearly 86%. 
A growing body of literature indicates that 
the effectiveness of a drug, the likelihood of 
it causing side effects and the nature of those 
effects can all vary between people of dif- 
ferent ancestry’. And funders and research- 
ers have repeatedly said that clinical trials 
should include more participants from eth- 
nic minorities. Indeed, 25 years ago the US 
National Institutes of Health Revitalization 


Act called for more people from ethnic 
minorities to be included in clinical trials. 

In our view, drug developers should 
capitalize on the global expansion of clinical- 
trial locations to design studies that repres- 
ent more of the world’s population. 


PROBING THE DATA 
To understand which populations provide 
the drug safety and efficacy information 
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>» used by the FDA, we reviewed the 
approvals made by the agency at five time 
points from 1997 to 2014. For each approval, 
we documented the reported race of the 
people involved in the clinical trial and the 
country where the trial was conducted. (All 
were pivotal efficacy trials, generally the 
most definitive demonstrations of a medi- 
cine’s efficacy and safety.) We focused only 
on the categories ‘white, ‘black; ‘Asian’ and 
‘other’, because the reporting for these was 
uniform across countries. The data for eth- 
nicities, such as Hispanic or non-Hispanic 
people, were confounded by reporting 
inconsistencies and were therefore omitted 
from the analysis. 

We focused on treatments for heart 
disease, cancer and disorders of the central 
nervous system (CNS); 41% of the drug 
approvals made over the period 1997 to 
2014 were for these common global health 
problems’. We assessed FDA approvals 
because many countries follow the agency's 
lead when it comes to their own regulatory 
decisions. 

During the five years we assessed (1997, 
2004, 2009, 2012 and 2014), 81 drugs for 
heart disease, CNS disease and cancer 
won FDA approval (see Supplementary 
information) on the basis of clinical trials 
involving nearly 150,000 people. Twenty- 
nine countries contributed clinical-trial 
data in every year evaluated. For eastern 
Europe, Asia-Pacific, Latin America and the 
Caribbean, representation increased over 
the study period (see ‘Going global’). We 
classified regions according to the United 
Nations Regional Groups. 

Over the five time points, the racial make- 
up of the clinical trials stayed relatively 
stable (see ‘Going global’). The median 
percentage of African and African Ameri- 
can participants per trial ranged from 1.8% 
to 3.5%. For Asian participants, the range 
was 0% to 7%; for any group unspecified 
or not described as white, black or Asian, 
it was 1.4% to 3.4%. For context, according 
to the US census, 72.7% of the US popula- 
tion was non-Hispanic white in 1997 and by 
2014, this figure was 62.2%. Also, as of 2015, 
around 75% of the global population lived 
in Asia or Africa*. Of course, our analysis 
has limitations. The ideal mix of race and 
ethnicity will vary from country to country; 
we compared the trial demographics to the 
US population because we used trial data 
evaluated by the FDA. 


SHIFTING LOCATIONS 

There are various possible reasons why the 
diversity of trial participants is not increas- 
ing with the expansion of countries con- 
tributing to trial data. It is likely that fewer 
people are enrolling in the trials or fewer tri- 
als are being conducted in those countries 
that have begun contributing data more 
recently. 
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GOING GLOBAL 


The number of countries contributing to the clinical-trial data that the US Food and Drug 
Administration uses to approve drugs almost doubled between 1997 and 2014, from 32 to 57. 


The racial make-up of the trials was little changed. 


Countries contributing to clinical-trial data 


Enrolled participants in both years* 


Ethnicity of participants in clinical trials 


0.25% Asian 
[" 4% Other 


3% Black 3% Black 


92% 
White 


36,687 


participants 


1997 


None of the conversations we have had 
with drug developers suggests that the goal 
of including greater population diversity in 
clinical trials is driving the change in where 
trials are being conducted. 

We think that the sharp increase in the 
number of countries contributing to clinical- 
trial data (from 32 in 1997 to 57 in 2014) has 
been driven by two main factors. 

The first is a growing shortage of people 
from developed economies who are eligible 
for clinical trials. Wealthy countries are 
better able to adopt new therapies into 
standard care than are poor countries. 
So many patients in the United States 
and western Europe now have access to a 
variety of medications outside a clinical- 
trial setting. The scarcity of people with a 
particular condition who are not already 
taking medication — ‘treatment naive’ 
patients — has prompted drug developers 
to start recruiting overseas. 


Enrolled participants in 2014 


*lreland didn't enrol 
patients in 2014. 


No enrolment 


Asia has seen the most 
significant increase in 
numbers of participants 

' enrolling in trials over 
this time period. 


Despite the fraction 
of Asian participants 


6% Asian increasing, most 


- 5% Other 


participants are still 
white. 


86% 
White 


33,741 


participants 


2014 


The second factor is lower costs. In North 
America and western Europe, hospitals and 
other care centres tend to charge drug devel- 
opers much more for hosting clinical trials 
than do equivalent institutions in eastern 
Europe and Asia. Staff expenses also tend 
to be lower in less-wealthy countries and 
studies can be completed more quickly. 


DRUG EFFICACY 
Many studies indicate that the likelihood, 
nature and severity of side effects from a 
medication can differ between populations’. 
For example, the antiplatelet drug 
clopidogrel reduces a person’s likelihood of 
having a heart attack or a stroke after some 
heart procedures. Genome-wide-association 
and other studies have revealed that people 
with certain genetic variants of the CYP2C19 
gene, which encodes an enzyme that acti- 
vates the drug, might need a different and 
more expensive therapy’. And people of 
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Asian heritage are three times more likely 
to produce poorly functioning CYP2C19 
enzymes than are people of white heritage”*. 

Similarly, around 20 years ago, genomic 
and other studies, predominantly con- 
ducted in white populations, identified 
genetic markers that indicated whether 
people with cancer would have dangerously 
low counts of white blood cells after being 
treated with the autoimmune medicines 
6-mercaptopurine (6-MP) or azathioprine’. 
The markers were variants in a gene called 
TPMT. Studies subsequently conducted 
in Asian and South American patients 
receiving these drugs identified variants in 
NUDT15as the key predictor of whether the 
drug is toxic to a person's white blood cells*”. 
An important patient-safety marker had 
been missed for nearly two decades owing 
to a lack of comprehensive genomic testing. 


MOVING FORWARD 

Various efforts are under way to raise 
awareness about the importance of 
population diversity in clinical trials, and 
to improve the quality of the data that 
are collected on race and ethnicity (see 
‘Documenting diversity’). Too much of 
what we know about population differ- 
ences in drug responses is anecdotal or 
stems from observations of a handful of 
patients. Most clinical trials do not enrol 
enough patients from diverse populations 
to provide definitive guidance. 

It is obviously not practical to try to 
conduct drug trials in every recognized 
patient population in the world. But we think 
that developers are missing an opportunity 
to capitalize on the increase in the number of 
countries hosting clinical trials. Researchers 
should be designing studies — and choosing 
study sites — to make clinical trials more 
informative about the safety and efficacy of 
drugs for as much of the global population 
as possible. 

This could mean conducting clinical 
trials in particular places. Or it could mean 
increasing the numbers of patients recruited 
to certain trials. Another option could be to 
conduct ‘ethnobridging’ studies. Here a drug 
that has been assessed in a major clinical trial 
is then tested on a smaller number of people 
from a population of interest to gain insights 
about side effects, appropriate dosing levels 
and so on for that population. Japan uses this 
approach a lot: medications deemed safe and 
effective from global trials are subsequently 
tested on Japanese patients”. 

Regulatory authorities could do more. 
Mechanisms for homing in on ‘special pop- 
ulations’ in drug development are already 
in place. For instance, the FDA frequently 
requires developers to conduct a further 
trial focused on a certain age group ifa large 
proportion of those likely to use the drug 
in question will be, for example, older than 
75. The agency and other regulators could 


DOCUMENTING DIVERSITY 


Regulatory efforts 


Various international endeavours have 
attempted to provide guidance on how 
to consider and document population 

diversity in clinical trials. 

The International Conference on 
Harmonisation (ICH) has produced reports 
on this issue. The ICH brings together the 
regulatory authorities of Europe, Japan 
and the United States with experts from 
the pharmaceutical industry to create 
standards for the conduct of clinical trials. 
The ICH E5 report sets out how to evaluate 
the impact of ethnicity on a medicine’s 
efficacy and safety. The ICH E17 (released 
in 2017) offers guidance on planning and 
designing clinical trials conducted across 
multiple locations". 

Similarly, in 2012, it became a 
requirement in US law that the Food and 
Drug Administration report on the extent 
to which demographic subgroups are 


apply similar mechanisms to race or ethnic- 
ity. Regulators could also provide guidance 
for manufacturers on how genetic-ancestry 
information could be used to expand the 
assurances for clinicians and patients around 
drug safety. 

Ensuring that more populations are 
represented in clinical trials might also 
require that developers, regulators and 
others address some of the social barriers 
to enrolment. 

Within any one country, some popula- 
tions might, for cultural or historical rea- 
sons, prefer not to engage in a scientific 

study. Or they 


“Most clinical might have lim- 
trials do not ited access to the 
enrol enough medical centres 
patients where clinical tri- 
from diverse als are being con- 
populations to ducted. Currently, 
provide definitive °™e investigators 
guidance.” are working to 


address this, for 
instance by trying 
to build trust with communities on their 
turf. As far as we know, no such efforts are 
being pursued at scale by regulators or by 
pharmaceutical companies. 

The ideal would be to match treatments 
to individual patients on the basis of their 
genetic, proteomic or other profiles. In 
theory, that would remove the need to even 
consider race or ethnicity. With this goal 
in mind, identifying the individual differ- 
ences that influence the risks and benefits 
of a medication should be a component of 
clinical trials. 

For now, however, investigators must at 


represented in clinical studies for new 
drugs, medical devices and biologics such 
as monoclonal antibodies. The agency has 
produced reports summarizing these data, 
including the analysis Global Participation 
in Clinical Trials using data from 2015 

to 2016 — an evaluation that is largely 
consistent with our findings. 

Building on these reports, the FDA 
updated its guidance on how to collect 
race and ethnicity data in clinical trials. 
And in 2014, it began releasing Drug 
Trials Snapshots, public information on 
participants in the clinical trials that have 
supported the agency’s approval of new 
drugs (including people’s sex, race, age and 
SO On). 

These are steps in the right direction. 
But they have not yet driven a sufficient 
increase in the diversity of populations 
represented in clinical trials. T.0.K. @H.L.M. 


least exploit the fact that clinical trials are 
being done in many more countries than 
they were two decades ago — and strive to 
obtain a more complete picture of disease 
and how to treat it, to the benefit of all. m 
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You are not a 


Twenty years on, Dave Reay speaks out about the depression 
that almost sunk his PhD, and the lifelines that saved him. 


black dog. That’s how Winston 

Churchill described depression. 

A dark companion that lies quiet 
in the good times and is your master in 
the bad. Its arrival is hard to predict, just 
as its departure seems impossible until it 
has happened. Depression is as varied as 
the people who experience it, as shifting in 
form as a murmuration. Yet its flavour is 
unmistakable. 

It is twenty years since my own black dog 
stalked away. Twenty years since debilitating 
depression saturated my every thought and 
act. Survivors are forever wary that the dog 
will return. That it will overcome the new 
defences, leap the redoubts. It’s always out 
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there somewhere, skulking; reminding us to 
watch our step. 

Mental illness affects around a quarter 
of us at some point in our lives', with more 
than 300 million people battling depression 
worldwide’ and academics at greater risk 
than many’. The pressure to publish, make 
an impact, win funding, chase tenure, engage 
the public, shine on social media and influ- 
ence policy, combined with failing experi- 
ments, lone working and rigid hierarchies, 
are all threats to our well-being. Yet our job is 
a thing of beauty, too. To explore new ideas, 
to grapple with the unknown, and ultimately 
to understand: that is a privilege. 

As scientists, we talk, alot. To our students 


lone 


and peers, to policymakers and funders. We 
become experts in telling the world about 
our work and opinions, but we risk not truly 
hearing ourselves or those around us. 

My own two decades of silence were borne 
of fear. Fear of ridicule and mistrust, of overt 
hostility and covert isolation. Talking to 
friends, sharing memories of the dark times, 
has tempered these fears. Although speaking 
openly about my illness still makes me anx- 
ious, it is nothing to the waking nightmare 
lived by my student self. 

IfI could speak to him now, Id tell him 
not to be afraid, to trust in people. Id tell 
him how wonderful the warmth of those he 
spoke to will feel, how sharing the pain of his 
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mental illness would diminish its effects, not 
him. I would hug him and whisper: “You are 
not alone.” 

I tell this story for anyone who is silenced 

by the stigma of depression. I tell it to thank 
the wonderful people who gave me a voice, 
and then listened. I tell this story because I 
can, because I survived. 
Early one year in the mid-1990s my under- 
graduate road was nearing its end. Three 
years of marine biology, substance abuse, 
mild poverty and falling in love. By October, 
I was a doctoral student 500 kilometres away 
from family and the love of my life. 

It was natural to be scared. New people, 
new place, unknown rules and expectations. 
On the tall stool at my carefully demarcated 
square of lab bench I sat each day, trying to 
fathom the instructions of my supervisor. 
The research was on impacts of warming in 
the Southern Ocean — growing algae in the 
lab for starters, maybe a trip south later on. It 
was a masquerade from the start. The tech- 
niques and equipment were alien, my nod- 
ding response to instruction, an act. I tried to 
mimic the easy confidence of the other stu- 
dents, to fit in. My supervisor and his group 
must have assumed I had some idea of what 
to do. I hadn't. Nota clue. 

Batch after batch of my growing medium 
went bad. Each morning, returning to the 
lab from my bedsit, the large glass bottles 
would have the unmistakable milkiness of 
contamination. Hours of weighing, mixing 
and sterilization would follow as I flushed 
with embarrassment and tried to suppress a 
mounting fear. Fear of being reprimanded. 
Fear that I was just too stupid. Fear that 
everyone would find out. 

The days merged into weeks; the failures 
continued. My supervisor called me to his 
office. He was unimpressed. I needed to 
do better, work harder, get it right. Walk- 
ing back to my bench that day, a black dog 
walked with me. When exactly it had arrived 
I can't say. I was 21. My ramshackle mental 
defences had been crumbling for a while. 
Ill never know what tore the breach. It may 
have been an admonishment from a lab 
technician or just one more cloudy culture 
bottle. Whatever had splintered the final 
defence, depression was my new master. 

It is amazing how we can maintain a 
facade of normality while behind it a mael- 
strom of disintegrating sanity roars. By 
Christmas, I was thinner and quieter, but 
still me. Time with family and fiancée kept 
the black dog subdued. It waited. 

Instead of seeking help, I counted down 
the remaining days of holiday like a con- 
demned man. Sometimes you can see deep 
depression coming for you, slamming doors 
of escape, tightening the orbits of despera- 
tion. Deciding I would kill myself was a 
relief. I wrongly felt that it was the only door 
left open. One place the black dog could not 
follow. 


Saying goodbye to my family when I 
returned to university that January was 
brutal. My mask could not withstand the 
warmth of those people I so loved. Through 
the sobbing, fear drove me on, piloting my 
body through the long journey back to my 
term-time lodgings. 

Alone in the house I ran a hot bath, drank 
whisky without tasting it and selected the 
sharpest kitchen knife. I sat in that bath for 
an age. The knife hurt on my flesh, whisky 
or no. I wept and retched and wept some 
more. Too afraid to cut deep, anxious about 
what people would 


think, about the “Eyenthrough 
pain. I drained the darkest 
the bath and went days, gripping 
to bed. my lab bench 
The months  t9 suppress the 
that followed convulsions 
a acetone within, Inever 
normality and hated the 
science. 


despair. Depress- 
ion is exhausting. 
Even eating and washing felt irrelevant to my 
own contracting existence. Some days were 
doable, others I spent lying balled and wide- 
eyed, waiting for the night. When it arrived, 
sleep delivered only confused marathons of 
lost time. 

My second attempt at suicide came just 
before Easter and was longer in the planning, 
the means gleaned from the Dick Francis 
mystery Comeback and obtained from the 
lab’s chemical store. An injection, some nau- 
sea, a short blackout, that’s all. 

Throughout this period I self-medicated 
with alcohol, and cannabis when it was avail- 
able; anything for oblivion. I didn't seek help, 
therapy, or prescription drugs, nor did I quit 
my PhD. I survived long enough for help to 
find me, eventually. 

By the summer, life settled. Research, 
relationships, even my supervisor: all the 
quakes that had brought down the walls of 
my mind were still. I laid plans for work on 
an icebreaker in the South Atlantic that com- 
ing winter. It was exciting, an opportunity 
Ihad dreamed of, and yet still one in which 
I pictured my own destruction: deep inside, 
thoughts of suicide lingered, a secret obses- 
sion. A vision of slipping unseen into a remote 
sea, all a terrible accident, a tragic loss. 

It was people who saved me. They still 
do. A warm word from a friend, a joke and 
a moan with a colleague. Day by day, piece 
by piece, the people around me unknow- 
ingly brought my mind back to health. One 
man in particular — a beautiful human 
and can-do lab technician named Paul 
Beckwith — helped me to trust in others, to 
share and to belong. 

In December, just over a year after the 
start of my PhD, I joined the research ship 
near the Falkland Islands and my convales- 
cence accelerated. I sipped daily on a tonic of 


big-hearted, inspirational and funny scien- 
tists. Being stuck together in floating isola- 
tion for months could have been torment. 
Instead it was my salvation. 

The cloudy jars and experimental 
despair were replaced by thriving baths 
of algae, and geeky thrills as the microbes 
responded to artificially induced future 
climates. Even through the darkest days, 
gripping my lab bench to suppress the con- 
vulsions within, I never hated the science. 
Now that I could finally understand how 
to do it, I loved it. 

The savage beauty of the South Atlantic, 
its sparkling wildlife and scientists, will 
always be a touchstone for me. A month 
after joining the ship and at the end ofa long 
night shift, I stood alone on its rolling deck. 
Dawn broke, and tears streamed down my 
face. Tears of joy and release. The black dog 
was gone. 

The two decades since have sewn their 
own patchwork of light and dark. Deaths 
of those close to me, including the beautiful 
Paul, have called my black dog close again at 
times. But, for me, it is kept at bay by lucky 
brain chemistry, and the warmth and under- 
standing of my wife (that love of my life), and 
of our children and friends. Not everyone is 
so fortunate. 

As the supervisor of dozens of students 
and staff, empathy is useful, listening is 
vital. Just a chat, making time for people; 
it can mean the world. We are mentors, not 
therapists, but our university commun- 
ities are under enormous stress. Insecure 
jobs and mounting debt, endless metrics 
and poor management — all are risks to 
our mental health in the edifice of corpor- 
atization that our higher-education system 
has become. 

Awareness is growing, with supervisor 
training, peer-support networks and coun- 
selling services now commonplace. More 
importantly, the stigma of mental health 
issues is fading. Whether it’s to a friend or 
loved one, a therapist or colleague, those 
afflicted by mental illness must feel secure 
in speaking out. Silence was for too long the 
enemy of my own healing. Next time that 
black dog comes close, it will be met with 
aroar. @ 


Dave Reay is professor of carbon 
management and education, School of 
Geosciences, University of Edinburgh, UK. 
e-mail: david.reay@ed.ac.uk 
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George and Robert Stephenson’s Rocket won an 1829 competition to find the engine for the world’s first passenger railway. 


Titans of power 


Roger Fouquet critiques Richard Rhodes’s study of energy pioneers. 


ichard Rhodes is renowned for his 
R magisterial four-volume series on 

nuclear weaponry, starting with The 
Making of the Atomic Bomb (1987). Now, he 
sets his sights on an even broader landscape. 
Energy sweeps from the end of the sixteenth 
century to the golden era of energy before the 
1970s oil shocks, charting the challenges of 
engineering power, light, warmth and low- 
polluting technologies. Focusing on the West's 
rise to energy supremacy through mastery of 
technologies from steam to nuclear power, 
Rhodes explores the choices that led us to a 
present threatened by climate change. 

The bulk of his history centres on creative 
individuals, rather than economics or broader 
social and cultural implications of their work. 
As he shows, most of these engineers and 
inventors were not hermit geniuses. They 
often collaborated with peers, absorbed ideas 
and benefited (in ‘shoulders of giants’ mode) 
from earlier inventions. In re-enacting the 
build-up to pivotal events, Rhodes brings us 
into the eureka moments behind them. 

He captures both the limitations of tech- 
nologies and the improvements that inven- 
tors offered. In the 1760s, for instance, master 
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instrument-maker James Watt boosted the 
efficiency of steam engines by separating 
the working cylinder from the condenser. 
Rhodes also deftly portrays intellectual 
debates. We read the late-eighteenth-century 
correspondence of electrical experimentalists 
Luigi Galvani and Alessandro Volta, which 
revolved around giving shocks to frogs’ legs. 
And he relates the nineteenth-century battle 
of alternating and direct currents between 
Thomas Edison and George Westinghouse. 
Rhodes portrays innovators and entrepre- 
neurs as obsessively 
dedicated, often at the 
expense of health and 
prosperity. Engineer 
Richard Trevithick, 
inventor of the high- 
pressure steam engine, 
abandoned his family 
in 1816 to go to Peru, 
where his machines 
were deployed to 
dredge silver mines. 


Energy: AHuman 


Histor 
He made and lost for= ya RHODES 
tunes, and returned — simon & Schuster 
to see others (such (2018) 


as George Stephenson and his son Robert) 
develop the first steam railways. He died a 
pauper. Society, Rhodes shows, rarely repaid 
its debt to the era's technologists. 

Political intrigue figures large. Inventors 
squabbled over patents, or lobbied govern- 
ment through intermediaries to introduce 
laws protecting their interests. Thomas 
Savery’s experience is fascinating. The 
Enlightenment engineer’s connections 
enabled him to extend the patent on his 
early steam pump by 21 years, through the 
1699 Fire Engine Act. As a result, in 1712, 
Thomas Newcomen — who had designed 
a more powerful engine using atmospheric 
pressure — was forced to go into partnership 
with Savery. The engines could pump water 
out of deep mines, solving a major problem. 

Rhodes also stresses the harm of energy 
exploitation, such as the devastating toll 
wreaked on cetacean populations by the 
whaling industry, in pursuit of oil for lighting. 
He highlights disasters, from London's Great 
Smog of 1952, which killed thousands, to the 
ongoing threat of climate change. 

Rhodes’s discussions about environ- 
mental damage are welcome, and his tour 
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of bold innovators is well guided. But he 
fails to mention the women who helped to 
revolutionize the field — from discoverer of 
nuclear fission Lise Meitner to solar-power 
pioneer Maria Telkes. Nor does he devote 
much space to renewable energy. His history 
is uneven in other ways: the latter third of the 
book is a critique of the anti-nuclear lobby 
that, in my view, skews the overall message. 

Rhodes believes that protesters have sty- 
mied our transition to nuclear power, as has 
the adoption ofa “linear no-threshold” model 
of the biological effects of radiation, which 
proposes that even low levels are potentially 
harmful. Enough evidence is in place to 
question this model. But it is interesting how 
Rhodes repeatedly reminds us that the energy 
industry has a history of denying damage in 
the interest of profit, yet he seems to believe 
the nuclear industry immune to this practice. 

Rhodes argues for the commercial viability 
of nuclear technology. He calls the US project 
to build an early pressurized-water reactor in 
the 1950s “a godsend’; by 1954, he claims, “it 
was already competitive with non-nuclear 
power in Western Europe and Japan”. In my 
opinion, this ignores the full costs. No nuclear 
power station has ever won a competitive ten- 
der. Private companies still refuse to invest in 
the technology without massive government 
support. With a lack of significant invest- 
ment in Europe and the United States, today’s 
nuclear power industry is focused in China, 
where subsidy is a given and cost is less of an 
issue. I feel that, instead of blaming protesters, 
Rhodes should acknowledge that the failure 
of nuclear power is down to the fact that the 
economics haven't worked. 

From this perspective, the book’s prime 
conclusion is highly optimistic. Rhodes 
asserts that humanity will be able to pro- 
duce power on demand for hundreds of 
years more. He is concerned about climate 
change, but believes that low-carbon energy 
sources (including renewables and nuclear 
power) will solve the problem. This may well 
be true; perhaps the international experi- 
ment ITER and its successors will manage 
to provide cost-effective nuclear fusion. But 
innovators will need not only to create new 
kit. They will also need to spur new policy 
levers and behavioural patterns — and even 
social and political systems. 

Energy is packed with good stories but, 
ultimately, I see it as like a Thomas Savery 
engine: it generates a lot of heat, but loses 
steam. A scholar of Rhodes’s stature should 
have offered a deeper understanding of our 
struggle to improve our energy capacity — 
one of our most pressing global challenges. m 


Roger Fouquet is associate professorial 
research fellow at the Grantham Research 
Institute on Climate Change and the 
Environment at the London School of 
Economics and Political Science. 

e-mail: r,fouquet@lse.ac.uk 
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Books in brief 


Chasing New Horizons 

Alan Stern and David Grinspoon PICADOR (2018) 

It is the farthest-flung planetary mission yet. In July 2015, 

NASA spacecraft New Horizons zipped past the Pluto system, 

5 billion kilometres from Earth, at a cool 52,000 kilometres an hour. 
The stunning portraits it snapped range from Pluto’s heart-shaped 
Tombaugh Regio and towering spires of methane ice to its zoo of 
moons. Here, mission leader Alan Stern and astrobiologist David 
Grinspoon trace decades of painstaking preparation, and look to the 
craft’s future in the Kuiper belt and beyond — possibly heading into 
interstellar space. An exhilarating trek into the “wild black yonder”. 


— — The Book of Why 
in PEARL Judea Pearl and Dana Mackenzie BASIC (2018) 
= “Correlation is not causation.” That scientific refrain has had 
| BOo social consequences — not least, the long debate over links 
K OF between tobacco and lung cancer. Judea Pearl proposes a radical 
| WwW Hy mathematical solution. A prominent researcher in artificial intelligence 


| (Al), Pearl teams up with writer Dana Mackenzie to explore the science 
| RE ® . . rie) . soe . . 
= aveeSStNee | of causation, now bearing fruit in biology, medicine, social science and 
Erp, f ‘i . . ” = . 
“a | Al through tools such as the “causal inference engine”. The inevitably 


y chewy narrative is leavened by the contributions of half-forgotten 
researchers such as psychologist Barbara Burks. 
— Minds Make Societies 
_ Pascal Boyer YALE UNIVERSITY PRESS (2018) 
Minds Why do people believe absurdities, tolerate demagogues and 


cooperate (or not)? To decipher the complexities of human society, 

evolutionary psychologist Pascal Boyer draws from genetics, 

economics and biology. His incisive investigation of “six problems 

in search of a new science”, from group conflict to information, thus 
integrates an impressive range of research and attendant insights. He 
reveals, for instance, how ethnic strife is fed by a belief that national 
welfare is a zero-sum game, and why splashy wedding ceremonies 
cement pair-bonding. An elegant voice in a clamorous field. 


Milk! 

Mark Kurlansky BLOOMSBURY (2018) 

Best-selling author Mark Kurlansky follows up Cod (1997) and 
Salt (2002) with another zestful exploration of one foodstuff — 
milk — through history and a range of lenses. A human staple for 
10,000 years, milk was the first food to be investigated in a modern 
lab. It is now the most globally regulated. Kurlansky keeps up a 
cracking pace on a tour that covers classical geographer Strabo 
griping about the Celts’ milk consumption; the disease-generating 
dairies of nineteenth-century New York City; lactose intolerance in 
China; and 126 recipes for everything from ghee to syllabub. 


Atlas of a Lost World 

Craig Childs PANTHEON (2018) 

What was life like for the early humans who poured over the Bering 
land bridge to North America more than 20,000 years ago? On 
evidence, beyond brutal terrain and climate shifts, it involved 
harrowing games of hide-and-seek with colossal fauna such as 
the short-faced bear Arctodus simus. In this evocative scientific 
travelogue, Craig Childs enters that time and space, braiding 
together his treks along prehistoric pathways from Alaska to the 
south with palaeontological findings from coprolites to mastodon 
bones, and meetings with remarkable scientists. Barbara Kiser 
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Richard Feynman shared the 1965 Nobel Prize in Physics. 


THEORETICAL PHYSICS 


Feynman at 100 


Paul Halpern celebrates the oeuvre of a brilliant, original scientist on his centenary. 


pre-eminent twentieth-century 
Ars and a Nobel laureate: 
Richard Feynman was certainly 

those. He was also much more. As the 
centenary of his birth rolls around on 
11 May, a look at his scientific and cultural 
legacy recalls his restless multidimensional- 
ity. His popular books shattered readers’ pre- 
conceptions of scientists as lab-coated nerds 
and replaced them with a hipper image of 
a wild non-conformist; his scholarly tomes 
introduced researchers to revolutionary 
methods of grappling with modern physics. 
Feynman was a master conjuror of physics. 

A mathematical whizz with exceptional 
intuition, he seemed to pull solutions out 
of thin air. He crafted a lexicon for particle 
interactions: iconic squiggles, loops and 
lines now known as Feynman diagrams 
(see D. Cressey Nature 489, 207; 2012). His 
Nobel-prizewinning work on quantum 
electrodynamics included methods that even 
he saw as a sleight-of-hand for removing 
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infinite terms from calculations. Yet, his 
results — equivalent to more systematic, 
rigorously expounded mathematical tech- 
niques independently proposed by co- 
laureates Julian Schwinger and Sin-Itiro 
Tomonaga — matched atomic-physics data 
beautifully. 

Decades before his 1965 Nobel Prize, 
Feynman was already a legend of the 
Manhattan Project at Los Alamos, New 
Mexico, where he helped develop the 
atomic bomb. His colleagues and super- 
visors, including scientific director J. Robert 
Oppenheimer and head of the theoretical 
division Hans Bethe, were stunned by his 
computational abilities. 

Then there was the playfulness. His 
pranks, including safe-cracking and sneak- 
ing through security fences, and his passion 
for playing the bongos, were arguably as 
memorable as his science. Feynman loved 
telling stories about himself and observing 
the reaction — the more stunned, amused 


or horrified the better. In the 1980s, his 
friend and fellow drummer Ralph Leighton 
collected some of them in two bestselling 
volumes: Surely You're Joking, Mr. Feynman! 
(1985) and What Do You Care What Other 
People Think? (1988). 


PLAYFUL SPIRIT 

In the first, Feynman flaunted his rough 
edges and eccentricities, and much of the 
book is hilarious. (One story sees him 
bungling a sprinkler experiment in the 
cyclotron laboratory at Princeton University 
in New Jersey, shattering glass tubes and 
flooding the space with water.) The chroni- 
cles of Feynman’s Los Alamos days are 
exceptionally funny, such as his removal 
of the secret contents of physicist Edward 
Teller’s locked desk drawer after Teller told 
him it was impenetrable. However, the book 
shows its age in disturbingly sexist sections 
such as “You just ask them?’, about his pred- 
atory behaviour towards women. 
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CALTECH ARCHIVES, 


His relationships with women were com- 
plicated. In What Do You Care, he explained 
how he encouraged his younger sister Joan, 
now an acclaimed astrophysicist, to go into 
science. And he recounted how many of his 
attitudes had been shaped by his love for his 
first wife, Arline, who died of tuberculosis in 
1945, a few years after they married. Those 
stories were mostly written during the final 
stages of Feynman's life, after he had under- 
gone treatment for cancer. By that point he 
had undoubtedly become more cognizant of 
his legacy. 

Indeed, beneath his clown’s guise, 
Feynman was a sensitive man, suffering 
from both early grief and considerable 
anguish about the atomic weapons he had 
helped engender. These demons stymied his 
research from the end of the Second World 
War until 1947, when findings reported at 
the Shelter Island Conference in New York 
helped to spark his Nobel-prizewinning 
work. At the meeting, physicist Willis 
Lamb presented evidence of a discrepancy 
between predictions for certain energy levels 
of the hydrogen atom, calculated using the 
Dirac equation, and experimental results 
obtained using microwaves to excite the 
atom. Inspired by Bethe’s quick calculation 
of this Lamb shift, Feynman developed tech- 
niques to solve that problem and beyond, to 
the widest range of quantum electrodynamic 
interactions between charged particles. 


GOLDEN YEARS 
The next two years proved extremely 
productive in disseminating his extraordi- 
nary methods for calculating interactions 
in particle physics. Feynman published 
seminal papers such as ‘Classical electro- 
dynamics in terms of direct interparticle 
action’ (J. A. Wheeler and R. P. Feynman 
Rev. Mod. Phys. 21, 425-433; 1949) and 
‘Space-time approach to quantum electro- 
dynamics’ (R. P. Feynman Phys. Rev. 76, 
769-789; 1949). These build on each other 
like revelations in a Sherlock Holmes story 
— with the last supplying a complete resolu- 
tion of how electrons interact using photons. 
Through his diagrams, which surprisingly 
depict positively charged positrons as elec- 
trons moving backward in time, Feynman 
portrayed the gamut of possible modes of 
interaction, each contributing to the total 
picture in a weighted tally called “sum over 
histories”. In his brilliant vision, quantum 
reality is a cloud-like smear of particles’ pos- 
sible paths, as if a commuter from Essex to 
London could travel the entire route by train, 
coach, car and bicycle simultaneously. 
Feynman published two excellent 
primers introducing these quantum 
methods. Quantum Mechanics and Path 
Integrals (1965; co-authored with Albert 
Hibbs) begins with one of his favourite 
apparatuses, the double-slit experiment, 
and quickly moves into path integration 
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Richard Feynman lecturing at California State University, Long Beach, in 1979. 


and other advanced methods. The second, 
QED (1985; based on a lecture series) is 
more accessible — explaining the same 
theory using diagrams and examples. 

Starting in the 1950s, Feynman ventured 
into many other areas of theoretical physics: 
superfluids, superconductivity, gravitation 
and the constituents of protons and neu- 
trons, which he called partons. Two later 
papers that made a huge impact were his 
model of the weak interaction, “Theory of the 
Fermi interaction 


(R. P. Feynman “Feynman’s 
and M.Gell-Mann books urge us 
Phys. Rev. 109, [0 explore the 
193; 1958),andhis world with 


open-minded 
inquisitiveness.’ 


proposal for quan- 
tum computation, 
‘Simulating phys- 
ics with computers’ (R. P. Feynman Int. J. 
Theor. Phys. 21, 467-488; 1982). In each, 
he tinkered with variations until he reached 
definitive conclusions. 

Feynman was also a renowned educator. 
Those lucky enough to have attended his 
lectures have the best sense of how his 
agile mind operated. He taught a first-year 
course at the California Institute of Tech- 
nology (Caltech) in Pasadena: ‘Physics X; 
in which students would ask him anything 
and he‘ think on his feet. Feynman loved 
to astound, and often refused to provide 
solutions, to spur students on intellectually. 
The careful notes of attendees have been 
published as books and articles, bolster- 
ing his reputation as a master lecturer. One 
such from Caltech was the three-volume 
The Feynman Lectures on Physics (1964). 


+ J 


Another, 1959’s The Theory of Fundamental 
Processes, is based on notes taken by Peter 
Carruthers and Michael Nauenberg, two 
students at Cornell University in Ithaca, 
New York, when Feynman was a visiting 
lecturer there in 1958. Nauenberg told 
me how, during the first lecture, Feynman 
walked in, glanced at the blackboard, wildly 
erased the equations on it and declared 
that they would all learn the whole of phys- 
ics from scratch. Within a few weeks, the 
course proceeded from elementary quan- 
tum mechanics to Feynman’s rules for par- 
ticle-physics calculations. The Character of 
Physical Law (1967), another of Feynman's 
works, emerged from lectures he delivered 
at Cornell six years later. 

Feynman’s books urge us to explore the 
world with open-minded inquisitiveness, 
as if encountering it for the first time. He 
worked from the idea that all of us could 
aspire to take the same mental leaps as him. 
But, of course, not every ambitious young 
magician can bea Harry Houdini. Whereas 
other educators might try to coddle those 
who couldn't keep up, Feynman never 
relented. The essence of his philosophy was 
to find something that you can do well, and 
put your heart and soul into it. If not physics, 
then another passion — bongos, perhaps. = 


Paul Halpern is a professor of physics at the 
University of the Sciences in Philadelphia, 
Pennsylvania. He is the author of 15 science 
books, most recently The Quantum 
Labyrinth and Einstein’s Dice and 
Schrédinger’s Cat. 
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Sequencing risk to 
antibody patents 


Patenting claims for genetic 
sequences of potential therapeutic 
interest could be called into 
question ifthey match those in 
natural DNA. Next-generation 
sequencing (NGS) combined 
with single-B-cell isolation 
technologies could reveal 
millions of new human antibody 
sequences (see S. Rajan et al. 
Commun. Biol. 1,5; 2018; 
B. Wang et al. Nature Biotechnol. 
36, 152-155; 2018). 

NGS data are currently 
deposited at the Sequence 
Read Archive (see go.nature. 
com/2hhafpd). Although proper 
sequence annotations and 
analysis pipelines are still crucially 
needed (for searching the NGS 
data archive, for example), 
these data stand to reveal a 
wealth of information about 
human antibody repertoires and 
immunogenetics. 

However, patent applications 
for huge numbers of such 
newly discovered antibodies 
could be doomed — given the 
2013 US Supreme Court ruling 
that naturally occurring DNA 
segments are non-patentable. 
Prabakaran Ponraj Intrexon 
Corporation, Germantown, 
Maryland, USA. 
pponraj@intrexon.com 
PP. declares competing financial 
interests: see go.nature. 
com/2kwzwxt 


Bitcoin mining built 
on shifting sands 


I disagree with Stuart Wimbush’s 
arguments on the wealth 
generated during cryptocurrency 
mining (Nature 555, 443; 2018). 
Attributing value to the work 
itself rather than to its results 
works in a theoretical economics 
framework, but risks failing to 
capture the complexity of what 
wealth actually means. 

Many would argue that wealth 
is defined by social stability and 
by access to food, education and 
health care, rather than just by 


market values (see, for example, 
R. Costanza Nature 556, 300-301; 
2018). Living in an intact 
environment is also a form of 
wealth. The presupposition that 
cryptocurrency mining generates 
wealth, purely by converting 

vast amounts of energy into the 
results of otherwise purposeless 
algorithms, is in my view 
misguided — akin to claiming 
that multiplying the values of 

all state currencies by ten would 
make the world population ten 
times wealthier. 

Wimbush’s assertion that 
US$275 billion in wealth was 
generated through Bitcoin and 
Ethereum currency mining 
throughout 2017 overlooks 
their continuing and substantial 
devaluation. Bitcoin dropped by 
49% against the US dollar in the 
first quarter of this year alone. 
‘Wealth is merely a fluctuating 
number, which can be wiped out 
ina matter of weeks. I perceive 
the environmental damage that 
remains (see S. Foteinis Nature 
554, 169; 2018) as a negative 
global-wealth coefficient. 
Maciej Kawecki Swiss Federal 
Laboratories for Materials Science 
and Technology, Diibendorf, 
Switzerland. 
maciej.kawecki@empa.ch 


Secure energy, food 
and water in north 


Permafrost thawing in the 
warming Arctic and sub-Arctic 
is likely to cause widespread land 
subsidence, damaging homes 
and infrastructure and disrupting 
energy, food and water supplies. 
It could be particularly hard for 
developed countries around the 
North Pole to ensure that remote 
communities have access to these 
essentials by 2030, in accordance 
with the United Nations 
sustainable development goals. 
One way to help would be 
for governments to encourage 
adoption of renewable energy in 
the circumpolar north. This could 
power water-treatment facilities 
and greenhouses, thereby 
enhancing water and food 
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security, creating jobs and making 
communities self-sufficient. 

Many in the far north of 
Alaska, Greenland and the 
Canadian and Russian Arctic 
are not connected to centralized 
grid systems (L. Mortensen et al. 
Polar Geogr. 40, 163-185; 2017). 
Distribution of renewable energy 
would therefore depend on off- 
grid solutions such as community 
electricity generators, or on 
setting up and reconfiguring local 
microgrids. 

As more financial and 
technological support is provided 
to build community-scale 
renewable-energy facilities, 
subsidies on fossil fuels in 
the circumpolar north could 
gradually be phased out. 

Alex C. Y. Yeung University of 
British Columbia, Vancouver, 
Canada. 
alex.yeung@alumni.ubc.ca 


Kenya and Ghana’s 
new science funding 


Most African countries still 
do not prioritize research and 
development, despite its potential 
for resolving problems such as 
the continent's food security 
and its heavy disease burden. 
The governments of Kenya 
and Ghana aim to change this 
by setting up national funding 
programmes to achieve high- 
quality research output. 

Kenya's government has 
committed 2% of the country’s 
gross domestic product (GDP) 
to research. Its national research 
fund offers competitive grants to 
researchers in academia and the 
private sector. These financed 389 
graduate-student fellowships in 
2016 (see go.nature.com/2ji6lso) 
and 158 multidisciplinary or 
multi-institute research proposals 
(see go.nature.com/2rdduxv). 

In addition, 20 infrastructure 
development grants were 
awarded — to create centres for 
research into stem cells, cancer, 
fishing and food safety, for 
example. 

Ghana's government also 
intends to start its own national 


research fund, with a view to 
raising the money it spends on 
research to 1% of GDP. Last 
year it proposed investing the 
equivalent of US$50 million, 
pending parliamentary approval 
(see go.nature.com/2w140tn). 
It also plans to set up a National 
Science, Technology and 
Innovation Fund that is accessible 
to researchers and all potential 
inventors and innovators (see 
go.nature.com/2w2jjgl). 

Proper management will 
allow these initiatives to serve 
as a model for progress in other 
African countries. Government 
officials, grant managers and 
scientists must ensure, for 
example, that funding is awarded 
on merit and released on time. 
Thomas K. Karikari The 
Sahlgrenska Academy at the 
University of Gothenburg, 
Mélndal, Sweden. 
Patrick Amoateng University of 
Ghana, Accra, Ghana. 
ohenekakari@gmail.com 


Fourier’s Egyptian 
transformation 


You mark the 250th anniversary 
of the birth of Joseph Fourier by 
exploring the past and present 
ramifications of his celebrated 
mathematical transform (Nature 
555, 413; 2018). Also noteworthy 
is his expertise in Egyptology. 

As you mention, Fourier 
joined Napoleon Bonaparte’s 
Egyptian expedition as an 
administrator in 1798. He 
quickly became an accomplished 
scholar of Egyptology. It is said 
that in 1802 he showed a copy 
of the Rosetta Stone inscription 
to the young Jean-Francois 
Champollion (1790-1832), 
declaring it impossible to 
decipher the hieroglyphs 
it bore. Twenty years later, 
Champollion announced his 
success in decoding some of 
these hieroglyphs (see also 
A. Robinson Nature 450, 
793-794; 2007). 

Andreas Otte Offenburg 
University, Germany. 
andreas.otte@hs-offenburg.de 
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Scissors become sensors 


Two bacterial Cas proteins cleave nucleic acids indiscriminately after binding to specific target sequences. This property 
has now been harnessed to create highly sensitive, portable diagnostic tools for detecting viruses at low cost. 


D. DEWRAN KOCAK & CHARLES A. GERSBACH 


very low numbers of circulating viral 

particles, making the infection difficult 
to detect. For example, during one outbreak 
involving Zika virus, infected individuals har- 
boured as little as one virus copy per microlitre 
of blood serum’. These small numbers pose a 
large problem for medical diagnostics. How- 
ever, researchers might in the future be armed 
with molecular tools whose sensitivity has 
been refined by evolution. Writing in Science, 
three groups” * report a new class of diagnos- 
tic tool, created by repurposing nucleic-acid 
sensing systems found in bacteria. 

CRISPR-Cas systems are adaptive immune 
systems that protect bacteria from viral 
attack by functioning as nucleic-acid sensors. 
Nucleic-acid sequences from viruses that have 
previously invaded a bacterium are stored in 
the cell’s genome, and are used by Cas enzymes 
as a guide with which to recognize and bind to 
the same sequence if the virus invades again. 
Incoming viral nucleic acids are then cleaved 
through sequence-specific interactions. 

One particular CRISPR system, derived 
from the bacterium Streptococcus pyogenes, has 
been adapted for use as a genome-editing tool 
that is revolutionizing biomedical research. 
However, there are many other types of 
CRISPR system, each of which has specific 
characteristics. The three current studies use 
CRISPR systems that have an unusual prop- 
erty: after engaging a target nucleic acid, they 
cleave other nearby nucleic acids indiscrimi- 
nately, a process called collateral activity. The 
studies use this activity as a signal amplifier, 
whereby the cutting of many arbitrary nucleic 
acids serves as a beacon for the detection of 
one specific molecule. 

In the first of the current papers, Chen et al.” 
examined a type V CRISPR system that uses 
Cas12a proteins to target viral DNA. The 
authors discovered that Cas12a has collateral 
activity, indiscriminately cleaving single- 
stranded (ss) DNA after binding to its spe- 
cific double-stranded (ds) DNA target. The 
researchers took advantage of this property to 
create a diagnostic tool for detecting viruses. 

In the authors’ diagnostic, Cas12a guide 
sequences are designed to match particular 
targets in the viral DNA. After binding a viral 


Pp eople infected with a virus often harbour 
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Figure 1 | Repurposing CRISPR-Cas to produce virus sensors. a, Chen et al.” have developed a virus 
sensor called DETECTR, which has three parts: a Cas12a enzyme; an RNA sequence designed to guide 
Cas12a to a matching DNA sequence in a virus of interest; and a single-stranded (ss) DNA construct 
tagged with ‘fluorophore’ and ‘quencher’ molecules. In the absence of the target virus sequence, Cas12a 
is inactive and emission produced by the fluorophore is captured by the quencher (yellow arrow), 
preventing fluorescence. When Cas12a recognizes its target, it cleaves the ssDNA construct, separating 
the quencher from the fluorophore to generate fluorescence. b, Gootenberg et al.* and Myhrvold et al.* 
developed tools based on a system called SHERLOCK (ref. 6) that is similar to DETECTR. SHERLOCK 
uses Cas13, which targets viral RNA and cleaves ssRNA constructs. Gootenberg et al. designed two 
ssRNA constructs, one cleaved by Cas13 and one by another Cas enzyme, Csm6. Csm6 is activated by 
Cas13 cleavage products, and serves to amplify the fluorescent signal and improve the reaction kinetics. 


target, Cas12a cleaves a ‘reporter construct 
— an ssDNA that is tagged at one end with 
a fluorophore molecule and at the other with 
a quencher molecule. Before cleavage, the 
fluorescence emitted by the fluorophore is 
captured by the quencher. However, after the 
ssDNA has been cleaved by Cas12a, fluores- 
cence can be readily detected (Fig. 1a). Chen 
and colleagues boosted this fluorescent signal 
using a process called recombinase polymer- 
ase amplification (RPA) to amplify the target 
viral DNA, generating many more copies of the 
target sequence than are present in the original 
sample. 

The group’s system, which they name 
DETECTR (DNA endonuclease targeted 
CRISPR trans reporter), can detect target 
DNA at extremely low concentrations — as 
little as one molecule per microlitre of sam- 
ple analysed. The authors also showed that 


DETECTR could discern different strains of 
human papillomavirus in crude DNA extracts 
from patient samples. 

In the second paper, Gootenberg et al.’ 
expanded on previous work” ’ to create a diag- 
nostic called SHERLOCKvz2 (specific high sen- 
sitivity enzymatic reporter unlocking version 
2). The original version of SHERLOCK works 
similarly to DETECTR, but uses Cas13, which 
binds to and indiscriminately cleaves RNA, 
rather than DNA*. SHERLOCK is not limited 
to RNA detection, however — a transcription 
step can be included in the RPA process to 
produce RNA from viral DNA, enabling DNA 
detection by proxy’. 

Gootenberg and colleagues improved 
SHERLOCK in several ways. First, they 
screened 17 Cas13 family members and thor- 
oughly characterized their activity. The authors 
discovered that a subset of Cas13 variants 
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target mutually exclusive sequences. By using 
four such Cas13 proteins and designing a spe- 
cific reporter for each, the authors created a 
SHERLOCKvz2 system that could detect up to 
four viruses at once. Furthermore, by scaling 
up the RPA step, the group detected viral DNA 
sequences at concentrations as low as two 
copies per millilitre of sample. 

The researchers also discovered that Cas13 
cleavage products can activate another Cas 
protein, Csm6. By including a second ssRNA 
construct that could be cleaved by activated 
Csm6 (Fig. 1b ), the group boosted the signal 
relative to background and improved the 
kinetics of the SHERLOCKy?2 reaction. Finally, 
the authors combined all of these advances 
into a simple assay in which a drop of sample 
is applied to a strip of paper that holds the 
diagnostic, and a colorimetric readout is given. 
This format requires no instruments, greatly 
increasing the ease with which the technol- 
ogy can be used by scientists and clinicians in 
regions of high need. 

The techniques developed by Chen et al. 
and Gootenberg et al. both require stringent 
purification steps to prevent viral degrada- 
tion by the body’s RNA-digesting enzymes 
during testing. In the third paper, Myhrvold 
et al.” paired the previously reported version 
of SHERLOCK with a method for inactivat- 
ing these enzymes in body fluids, allowing 


the authors to directly detect viruses in urine 
and saliva. Their approach can be used to dis- 
criminate between related viral species, which 
can be difficult to tell apart because they cause 
similar symptoms. 

Viral genomes can mutate rapidly, but 
Myhrvold and colleagues showed that they 
could use their system to discriminate 
between sequences that differed by a single 
nucleotide mutation. This enabled them to 
identify strains of Zika virus isolated from 
different geographic regions, as well as a Zika 
virus harbouring a mutation associated with a 
developmental condition called microcephaly. 
They showed that appropriate guide sequences 
can be designed in less than a week, and the 
whole protocol implemented in less than two 
hours, with minimal equipment and sample 
processing. 

The diagnostic tools described in the current 
papers are a major advance for the tracking and 
treatment of viral outbreaks. But they could 
also have diverse applications beyond viral 
detection, such as identifying tumour-specific 
cancer mutations for personalized medicine, 
and enhancing quality control in agriculture 
and biomanufacturing by detecting possible 
biological contaminants. Although their long- 
term shelf-life remains to be tested, the tools 
excel in all other ASSURED criteria set by the 
World Health Organization for point-of-care 


Models of Parkinson’s 
disease revisited 


Conventional models propose that activity levels in two neuronal pathways, 
which have opposing effects on movement, become imbalanced in Parkinson’s 
disease. Analyses in mice point to a more complex reality. SEE ARTICLE P.177 


THOMAS WICHMANN 


cientists have long associated structures 
in the forebrain called the basal ganglia 
with the control of movement, and with 
movement disorders such as Parkinson’s 
disease. Some of the strongest evidence for 
the role of the basal ganglia in movement is 
anatomical: the basal ganglia receive neu- 
ronal input from movement-related areas 
in the brain’s frontal cortex through a region 
called the striatum. They then send it back to 
the frontal cortex. Two pathways mediate this 
processing, but models to explain how process- 
ing functions go awry in movement disorders 
are debated. On page 177, Parker et al.? add 
valuable data to the discussion. 
In conventional models of basal-ganglia 
function, the two pathways that connect the 
striatum to output structures work in opposing 


ways. The first, dubbed the direct pathway, is 
activated by the neurotransmitter molecule 
dopamine, through D1-like dopamine recep- 
tors on direct-pathway spiny projection neu- 
rons (dSPNs). Activation of dSPNs is thought 
to facilitate movement’*”. The second, the 
indirect pathway, is inhibited by dopamine 
through D2-like receptors on indirect-pathway 
spiny projection neurons (iSPNs). In contrast 
to the first pathway, activation of iSPNs is 
thought to reduce movement'*”. 

Current models of basal-ganglia function 
emphasize that balanced activity in these two 
pathways is required for normal movement. 
For example, one model posits that sequen- 
tial activation of the pathways could ‘scale’ 
movements — activation of dSPNs would 
first facilitate movements, and subsequent 
activation of iSPNs would terminate them’. 
Alternatively, the action-selection hypothesis 
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devices®, and are ready for deployment. The 
impact and adoption of these technologies will 
be determined by factors such as regulatory 
approval, scaling up synthesis for mass pro- 
duction, distribution logistics and economics. 

Much of the excitement about CRISPR 
technology has centred on its use in gene or 
cell therapies, which are expensive processes 
that, for the foreseeable future, will probably 
be available only in prosperous regions of the 
world. The current studies greatly expand and 
diversify the possible beneficiaries of CRISPR 
technology, by developing a low-cost technique 
that has many important potential uses. m 


D. Dewran Kocak and Charles A. Gersbach 
are in the Department of Biomedical 
Engineering and the Center for Genomic and 
Computational Biology, Duke University, 
Durham, North Carolina 27708, USA. 

e-mail: charles.gersbach@duke.edu 
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proposes that interplay between dSPN and 
iSPN activation might assist the selection 
of particular actions by the frontal cortex, 
with initiation of certain movements associ- 
ated with dSPN activation, and movement 
prevention associated with iSPN activation®”. 

These models predict that movement 
disorders result from imbalanced pathway 
activities. For instance, it has convention- 
ally been argued’ that, in Parkinson's disease 
(which involves decreased striatal dopamine 
levels), the balance would shift in favour of 
iSPN activation, leading to slowness or lack of 
movements. By contrast, involuntary move- 
ment (dyskinesia), such as that associated 
with overuse of the anti-parkinsonian drug 
L-3,4-dihydroxyphenylalanine (L-DOPA), 
would arise from a shift towards dSPN 
activation (Fig. la). 

Parker et al. used a method called fluores- 
cence microendoscopy*”° to simultaneously 
monitor the activity and spatial arrange- 
ment of large groups of dSPNs and iSPNs in 
freely behaving mice. The two sets of neu- 
rons were genetically engineered to fluoresce 
when calcium entered the cell, indicating 
electrical signalling. The authors monitored 
three groups of mice: those under normal con- 
ditions, mice treated with a drug that causes 
the loss of dopamine-releasing neurons to 
mimic the parkinsonian state, and dopamine- 
depleted animals treated with L-DOPA to 
induce involuntary movements. 

The authors’ observations in normal animals 
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Figure 1 | Models of movement-related neuronal activation in the brain’s striatum. Activation of 
direct-pathway spiny projection neurons (dSPNs; active neurons depicted as blue circles, with increasing 
colour intensity indicating increasing levels of activity) facilitates movements, whereas activation of 
indirect-pathway spiny projection neurons (iSPNs; red circles) inhibits movements. Grey circles indicate 
inactive neurons of both subtypes. a, Conventional models predict that normal movement is associated 
with activation of some dSPNs and iSPNs. In disorders such as Parkinson's disease, in which movements 
decrease, dSPN activation is reduced and iSPN activation is increased. The opposite changes are found 
in dyskinesias, which involve involuntary movements. The conventional model does not predict specific 
spatial arrangements of activated neurons. b, By contrast, Parker et al.’ find that normal movement in 
mice involves activation of clusters (indicated in dashed lines) of dSPNs and iSPNs. In the parkinsonian 
state, SPN activation is reduced and iSPN activation is increased. Clusters of activated dSPNs are smaller 
than in the normal state, and active iSPNs are less clustered. The opposite occurs in dyskinesias. 


confirmed’ that similarly sized clusters of 
iSPNs and dSPNs are simultaneously activated 
during movement. These data contradict both 
the scaling model (under which dSPNs are 
expected to be active before iSPNs) and the 
action-selection model (which predicts that 
clusters of inhibitory iSPNs should be larger 
than those of facilitating dSPNs). Instead, 
they add to mounting evidence that these 
pathways might not regulate the basic execu- 
tion of movement, but might have a role in 
higher-order behavioural activities, such as 
the dynamic shaping of movements through 
reinforcement learning” or the control of 
movement ‘vigour’. 

When the authors studied dopamine- 
depleted parkinsonian animals, they found 
decreased dSPN activity and increased isPN 
activity, as predicted by the conventional mod- 
els. However, they also found declines over time 
in the movement-related activation of iSPNs 
(a phenomenon separate from their general 
increased activation level), and declines in the 
spatial clustering of their activation (Fig. 1b). 

As expected, drugs that activate D1-like 
receptors increased dSPN activity in parkin- 
sonian mice, whereas drugs that stimulated 
D2-like receptors reduced the elevated iSPN 
firing rates. -DOPA treatment, which leads 
to activation of both receptors, elicited both 
effects and was the only treatment to reverse 
the clustering deficit in iSPNs, perhaps 
explaining its superior clinical efficacy. Finally, 
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these beneficial effects became exaggerated in 
animals that developed involuntary move- 
ments induced by t-DOPA: iSPNs became 
underactive, and dSPNs became hyperactive 
and less clustered. 

Parker and colleagues’ study therefore 
supports many of the general conclusions of 
earlier work regarding activity changes in the 
normal, parkinsonian and dyskinetic states. 
However, it also shows that the conventional 
models are oversimplified, because they can- 
not account for the newly discovered changes 
in spatial clustering of active striatal neurons 
in disease states. 

This study is an excellent example of how far 
experimental analyses of circuits in the basal 
ganglia have come in the past few years. Where 
we could once record a single, unidentified 
striatal neuron, we can now monitor hundreds 
of genetically characterized neurons simulta- 
neously, albeit still at a low temporal resolution 
and at the expense of tissue damage caused by 
the implanted microendoscope. As Parker 
and colleagues demonstrate, these techniques 
can provide insights into disease mechanisms 
that were not possible with earlier methods. 
Beyond being of substantial academic inter- 
est, the insights afforded by the current study 
suggest that it might be worth developing 
therapeutic strategies for the treatment of 
Parkinson’s disease aimed at decorrelating 
hyperclustered striatal activity patterns. 
Similar approaches have been explored using 


pharmacogenetic techniques’° and deep brain 
stimulation’””* in other portions of the basal 
ganglia. 

As with any good study, this research leads 
to more questions than it answers, particularly 
with regard to the changes in spatial cluster- 
ing of active neurons seen by the authors. 
For instance, the mechanisms governing the 
opposing abnormalities in clustering in the 
parkinsonian and dyskinetic states are not 
understood, and it remains unclear whether 
they reflect changes in inputs to the striatum, 
or bona fide striatal phenomena. It is also not 
known whether similar clustering occurs in 
humans or other primates, or whether it is 
accompanied by altered clustering of neu- 
ron activity in other basal-ganglia structures. 
Only time will tell whether the gradual shifts 
in cluster sizes found in the parkinsonian and 
drug-treated animals are related to known 
adaptive processes, such as the loss of input- 
receiving structures called dendritic spines” 
on the surfaces of iSPNs and dSPNs’, as Parker 
et al. suggest. 

Finally, one of the most important questions 
is whether the observed changes in the spatial 
and temporal clustering of neuronal activi- 
ties in the basal ganglia are, in fact, causally 
related to aspects of Parkinson's disease or the 
development of dyskinesias. Regardless of 
the answer, Parker and colleagues’ study has 
added a new dimension to our understanding 
of the abnormalities seen in this disease. m 
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MICHAEL C. SINGER & CAMILLE PARMESAN 


Precarious 
preferences 


In 1993, three ecologists reported! that 
an isolated population of the butterfly 
Euphydryas editha that inhabited a 
meadow in Nevada was starting to evolve 
a preference for laying its eggs on Plantago 
lanceolata — a non-native plant introduced 
to the region by cattle ranchers. On page 
238, two of these researchers now show? 
that the butterflies became completely 
dependent on the exotic plant, with adverse 
consequences. 

Originally, E. editha laid its eggs on the 
native plant Collinsia parviflora, but the 
longer life span of Plantago enabled larvae to 


feed on it for longer, increasing larval survival. 


In the current study, Singer and Parmesan 
report that all the female butterflies they 
examined in 2005 preferred to lay their 
eggs on Plantago (pictured, E. editha resting 
on Plantago). And by 2007, all the larvae 
they found in the field were feeding on this 
non-native plant. 

But cattle ranching ceased in 2005, 
leading to a rapid build-up of grassy 
vegetation and a decline in the dominance 
of Plantago. This, in turn, led to the extinction 
of the isolated E. editha population between 
2007 and 2008, probably because the long 


PARTICLE PHYSICS 
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grasses blocked out sunlight, cooling the 
warmth-loving larvae. The growth of the 
grasses quickly abated, but the butterfly 
population remained extinct until between 
2013 and 2014, when E. editha recolonized 
the field, laying its eggs on Collinsia, and so 
setting the stage for the process to begin 
again. 

These findings demonstrate how 
the adaptation of insect populations to 


Weak charge of the 
proton measured 


The proton’s weak charge defines the strength of certain interactions between 
protons and other particles. A precise determination of this quantity provides a 
stringent test of the standard model of particle physics. SEE LETTER P.207 


XIAOCHAO ZHENG 


fundamental forces. However, only two of 

these forces have effects on macroscopic 
scales: gravity keeps us grounded on Earth, and 
electromagnetism causes lightning on stormy 
days. We are not directly influenced by the 
other two forces — the weak and strong forces. 
Similarly, it is generally known that mass is at 
the root of gravitational interactions and that 
electric charges and magnetic moments are 
central to electromagnetism. But the physical 


\ ubatomic particles interact through four 


properties that describe the strength of weak 
and strong interactions, known as weak and 
colour charges, respectively, are less familiar. 
On page 207, the Jefferson Lab Q,,.., Collabora- 
tion’ reports the first high-precision measure- 
ment of the weak charge of the proton, which 
sets tight constraints on physics that cannot be 
described by current theories. 

The strong force is so overwhelming that 
the particles that interact through it, known 
as quarks and gluons, are tightly bound to 
one other and exist only as composite objects, 
such as protons and neutrons. By contrast, the 


human-induced environmental change can 
render those populations dependent on the 
continuation of specific human practices 
—a potentially precarious position, given 
the rapidity with which such things can 
change. AnnaArmstrong 


1. Singer, M.C., Thomas, C. D. & Parmesan, C. Nature 
366, 681-683 (1993). 

2. Singer, M. C. & Parmesan, C. Nature 557, 
238-241 (2018). 


weak force is so feeble that its interactions are 
almost completely masked by those of electro- 
magnetism. One might therefore wonder how 
the weak charge of a particle can be measured 
if itis as small as the name implies. Fortunately, 
nature provides a convenient yardstick that is 
associated with a principle known as parity 
symmetry. 

A process conserves parity symmetry if it 
occurs with the same probability as its exact 
mirror image. It is straightforward to see that 
parity symmetry is broken in the macroscopic 
world, particularly in biological systems. For 
example, most humans are right-handed. 
If parity symmetry were conserved for the 
handedness of humans, half of the popula- 
tion would be right-handed and half would be 
left-handed. 

Particles also have a handedness. A right- 
handed particle spins in the direction defined 
by the curl of your four fingers when you 
point your right thumb along the direction 
of the particle's velocity. Conversely, a particle 
is left-handed if you must use your left hand 
to relate its spinning and velocity directions. 
Remarkably, all subatomic particles violate 
parity symmetry when they interact with one 
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another through the weak force. Weak charges 
can therefore be determined by comparing the 
behaviour of left- and right-handed versions 
of particles. 

To extract the proton’s weak charge, the Q\yeak 
Collaboration fired beams of electrons that had 
a particular handedness at a proton target. 
They measured an asymmetry that describes 
the difference in the probability that right- and 
left-handed electrons are scattered from the 
proton (Fig. 1). The authors found an asymme- 
try of -226.5 + 9.3 parts per billion, where the 
minus sign indicates that left-handed electrons 
are more likely to be scattered than their right- 
handed counterparts. To put the magnitude of 
this asymmetry in perspective: if parity sym- 
metry were violated for the height of moun- 
tains, Mount Everest and its mirror-image twin 
would differ in height by a mere 2 millimetres, 
and this difference would have been measured 
to a precision of + 80 um. 

The authors’ result has a much higher 
precision than all previous experiments that 
studied parity violation by scattering electrons 
from a nuclear target. The E158 experiment 
at the SLAC National Accelerator Laboratory 
in Menlo Park, California, had a comparable 
precision, but measured the weak charge of the 
electron rather than that of the proton’. The 
Qveak Collaboration used its measured asym- 
metry to determine that the proton’s weak 
charge is 0.0719 + 0.0045, which is in excellent 
agreement with the value predicted by the 
standard model of particle physics’. For com- 
parison, in the convention used by the authors, 
the proton’s electric charge is +1. 

One might question why physicists want to 
measure the proton’s weak charge to such high 
precision. The short answer: to test the limits 
of our knowledge. At a basic level, physicists 
seek to discover if, and at what length scale, 
current theories fail to explain observational 
data. Such a failure could imply the existence 
of a fifth fundamental force — a previously 
undiscovered type of interaction that has 
a role at energies higher than have been 
explored so far. 

The measurement reported by the Qyeax 
Collaboration shows that such interactions, 
if they exist, would reveal themselves at par- 
ticle energies beyond several TeV (1 TeV 
is 10” electronvolts). For comparison, the 
energies released in nuclear-fission reactors, 
in which nuclei split into two or more frag- 
ments, are at the level of 10°eV per particle. 
The authors’ lower limit for the energy scale 
of new physics is comparable to, and comple- 
ments, that set by experiments at the Large 
Hadron Collider at CERN, Europe's particle- 
physics laboratory near Geneva, Switzerland*”. 
This is remarkable, given that the energy of the 
authors’ electron beams is thousands of times 
lower than that of the Large Hadron Collider’s 
proton beams. 

More than a century ago it was demonstrated 
that electric charge comes in discrete chunks’, 
which provided a bridge between classical 
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Figure 1 | Parity violation in electron-proton 
scattering. Electrons can be either left- or right- 
handed, depending on the direction in which they 
spin (red arrows) relative to their motion (black 
arrows). The Jefferson Lab Q,,.., Collaboration! 
reports that left-handed electrons are slightly more 
likely to be scattered from a proton target than 

are their right-handed counterparts. This result 
violates a principle known as parity symmetry 
because the two scattering processes are mirror 
images of each other. The authors used the 
measured asymmetry to determine the value ofa 
fundamental physical quantity known as the weak 
charge of the proton to an unprecedented precision. 


electromagnetism and modern quantum 
mechanics. By measuring quantities — from 
physical constants to particle properties such 
as the proton’s weak charge — at increasingly 
higher precision, new interactions and parti- 
cles could be discovered that require current 
theories to be revised. Such lofty pursuits 
can be maintained only if research is carried 
out by multiple groups across generations, as 
opposed to single groups in isolated instances. 
One day, the knowledge acquired might lead to 
a breakthrough. In the meantime, it will make 
our world a better one by providing a deeper 
understanding of the physical laws of nature. m 
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Connections that 
control defence strategy 


When presented with a threat, most mice freeze or hide, but a few respond more 
aggressively. The brain circuits underlying this behavioural choice have now 


been unpicked. SEE ARTICLE P.183 


DAYU LIN 


y nine-year-old daughter got a pair of 
Me= for Christmas. To acclimatize 

her new pets to her presence, she put 
her hands into their cage. One of the mice ran 
straight to its shelter and hid; the other stayed 
ina corner, held its tail up against the glass wall 
and rattled it rapidly. Tail rattling is a com- 
mon response to stress across species (most 
famously in snakes’), and is considered a warn- 
ing to intruders’”. But why do some mice hide, 
and others threaten? What brain mechanisms 
determine which defensive strategy to deploy 
in the face of potential danger? On page 183, 
Salay et al.* reveal a brain region that could be 
responsible for this decision in mice. 

Salay et al. first surveyed the brain regions 
that are activated by threatening cues. They 
placed mice in a test arena, then presented 
them with an overhead looming stimulus — an 
expanding dark-coloured circle, to mimic an 
approaching predator. When the mice detected 
the looming cues, they either froze or quickly 


ran towards a shelter under which they could 
hide from the stimulus. Occasionally, mice 
rattled their tails once they had reached the 
safety of the shelter. 

To identify the neurons that determine 
these behavioural responses, the researchers 
analysed the protein c-Fos, which is expressed 
rapidly in neurons after they have been highly 
active. Many brain regions show consistently 
high levels of c-Fos in response to looming, 
and one caught the authors’ attention — the 
ventral midline thalamus (vMT). The vMT is 
interesting in that it is not a part of the eye-to- 
brain visual pathway or a motor pathway, as 
might be expected for neurons involved in pro- 
cessing visual stimuli such as the looming cues 
in this study. Instead, it receives diffuse inputs 
from limbic areas and the midbrain (regions 
that support emotion, motivational behaviours 
and bodily responses), and projects heavily to 
higher cognitive areas, such as the prefron- 
tal cortex*. Thus, this region is well suited to 
signalling the internal state of the animal and 
guiding its defence strategies. 
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Ceiling 
display 


Figure 1 | To recede or to challenge. Salay et al.’ report that a region of the brain known as the ventral 
midline thalamus (vMT) controls how a mouse responds to a threatening visual stimulus, such as an 
expanding dark circle in a ceiling display that looms overhead. a, The authors provide evidence that 
signalling from the vMT to the prefrontal cortex (PFC) promotes an active defence strategy, such as 
making an aggressive tail-rattling motion. b, By contrast, when levels of vMT activation are lower, mice 


respond to the looming stimulus by hiding. 


In vivo recordings of neuronal activity 
revealed that vMT cells are most active when 
the animals are responding actively to loom- 
ing stimuli by running and tail rattling. Salay 
et al. therefore asked whether vMT activity is 
sufficient to guide an animal's defence strategy. 
The authors injected the VMT with a viral con- 
struct carrying a receptor protein that, when 
activated, excites neurons. The receptor is acti- 
vated on binding to a small ligand, which the 
authors administered systemically weeks after 
injection of the virus. Strikingly, when Salay 
and colleagues artificially activated vMT cells, 
the mice no longer hid from the looming stim- 
uli, but instead rattled their tails frequently and 
ran around the open space. Conversely, when 
vMT cells were suppressed using a similar 
approach, the animals spent more time freez- 
ing and never rattled their tails in response to 
looming stimuli. 

To investigate how vMT activation affects 
responses to looming in more detail, the 
authors genetically modified cells such that 
they could be activated by a laser, giving milli- 
second-level control of cell activity. They found 
that activating vMT cells either in tandem 
with or 30 seconds before the presentation 
of looming stimuli promoted the same active 
defence behaviours, such as tail rattling. This 
behavioural shift could outlast the stimulation 
itself, which makes it likely that activating the 
vMT causes a shift in internal state, instead 
of eliciting acute motor actions. Consistent 
with this hypothesis, the researchers showed 
that vMT activation induces strong and long- 
lasting autonomic responses indicative of 


increased arousal, such as pupil dilation, but 
does not induce changes in motor activity in 
the absence of looming stimuli. 

Next, Salay et al. investigated which 
projections from the vMT are relevant for 
guiding defence behaviours. They found that 
distinct regions of the vMT project to the 
prefrontal cortex and basolateral amygdala, 
both of which are implicated in many com- 
plex functions, including fear and anxiety. 
The authors again used a viral technique, this 
time to manipulate specific VMT outputs, 
and found that activating VMT projections 
to the prefrontal cortex pathway promoted 
an active threat-coping strategy (tail rattling) 
and increased arousal (Fig. 1). By contrast, 
activating the vMT-basolateral amygdala 
projection favoured a different behaviour, 
freezing. Taken together, the authors conclude 
that distinct VMT outputs control opposing 
threat-coping strategies. 

Escaping and hiding are basic survival 
instincts when an animal is faced with a power- 
ful predator. However, from time to time, 
prey animals need to stand their ground. For 
instance, adult ground squirrels protect pups 
in a burrow fiercely — after detecting a nearby 
snake, the squirrel flags its tail and aggressively 
harasses the snake by biting and kicking dirt 
towards it. Asa result, the snake typically aban- 
dons its hunting effort and moves away”. When 
a weak animal challenges a stronger predator 
in this way, the animal might be considered 
‘brave. Does it stand to reason, then, that the 
vMT drives a ‘courageous state? 

Answers to this question remain unclear, 
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50 Years Ago 


Britain’s first heart transplant 
operation, performed last week 

at the National Heart Hospital, 

has been received with the 

usual gush of publicity which 
accompanies such events. But less 
acclaim has been given to the liver 
transplantation operation carried 
out at Addenbrooke's Hospital, 
Cambridge, on May 2, although this 
is technically by far the trickier of the 
two operations ... Several attempts 
at liver transplants have been made 
in the United States, but met with 
little success until June of last year. 
Since then advances in immuno- 
suppression and organ storage 

have enabled the livers of deceased 
persons to be maintained in a form 
acceptable to the recipient’s body for 
several hours ... The heart surgery 
team at the National Heart Hospital 
says that the heart transplant 
operation is the first ofa series of 
operations leading to acombined 
heart-lung transplant before the end 
of the year. 

From Nature 11 May 1968 


100 Years Ago 


The Gas Traction Committee, 
appointed in November last to 
consider the employment of 
coal-gas as a substitute for petrol 
and petroleum products in motors, 
its general safety, and conditions for 
use, has issued an interim report ... 
This deals chiefly with the present 
use of gas, mainly at low pressures, 
in suitable fabric bags; the work is to 
be continued to cover its use under 
higher compressions when the 
necessary appliances and plant are 
obtainable. The Committee is satisfied 
that gas can be efficiently, safely, and 
promptly substituted for motor spirit 
(only two minor accidents have been 
reported). Two hundred and fifty 
cubic feet are considered equivalent 
to one gallon of petrol, so that gas at 
4s. per 1000 cub. ft. is equivalent in 
cost to petrol at 1s. per gallon. 

From Nature 9 May 1918 
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because a direct assessment of courage is not 
possible in non-human species. Intriguingly, 
however, the internal state elicited by VMT 
activation seems to be perceived as positive, 
because the mice prefer to stay in a chamber in 
which they can receive the vMT stimulation, 
given a choice. Future studies that examine the 
correlation between vMT activity and courage 
in humans will help us to further understand 
the emotional state encoded by the vMT. 

It is also still unclear whether tail rattling 
in mice can effectively deter a predator in the 
wild. But this behaviour did successfully stop 
my daughter from bothering her pets. m 
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A closer look at 
human telomerase 


The telomerase enzyme performs a crucial role by maintaining protective caps 
of repetitive DNA sequences at the ends of chromosomes. A structure of human 
telomerase casts fresh light on its architecture. SEE ARTICLE P.190 


MICHAEL D. STONE 


he chromosomes of eukaryotic 

organisms, which include plants, 

animals and fungi, are capped by pro- 
tective structures called telomeres’. In certain 
types of proliferative cell, these structures 
are maintained by the telomerase enzyme. 
Inherited genetic mutations that compromise 
telomerase function cause disorders character- 
ized by deterioration of proliferative tissues’, 
whereas increased expression of telomerase 
supports unbridled cell growth in most human 
cancers’. Efforts to develop telomerase-based 
therapeutics have been hampered by an 
incomplete understanding of the structure 
and organization of the telomerase complex. 
On page 190, Nguyen et al.* report a structure 
of human telomerase bound to DNA, provid- 
ing an unprecedented view of how the enzyme 
complex is organized and establishing a frame- 
work for drug discovery. 

Telomerase is a ribonucleoprotein (RNP) 
enzyme — part RNA, part protein. To retain 
normal function, the enzyme must contain at 
least a telomerase reverse transcriptase (TERT) 
protein subunit and along, non-coding telom- 
erase RNA (TR). Telomerase complexes also 
usually include several species-specific proteins 
that are important for regulating RNP assem- 
bly’. The enzyme maintains telomeres by cata- 
lysing the addition of simple DNA-sequence 
repeats onto the ends of linear chromosomes. 
The catalytic TERT subunit uses a small por- 
tion of TR as a template for these repeats. 

Vertebrate telomerases also belong to the 
family of box H/ACA RNP complexes, which 
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generally modify essential cellular RNAs such 
as transfer RNAs, ribosomal RNA and splice- 
osomal RNA®. Curiously, despite possessing 
the RNA and protein components of H/ACA 
RNPs, telomerase does not exhibit the RNA- 
modifying activity typically associated with 
this class of complex. Nevertheless, muta- 
tions in those components impair telomere 
maintenance and cause diseases related to 
telomerase deficiency’. 

Human telomerase has low abundance in 
cells, and attempts to reconstitute it from its 
purified components have been thwarted by 
protein insolubility and the low efficiency of 
RNP assembly. This challenge can be over- 
come by expressing TERT and TR in cultured 
human cells that naturally express large quan- 
tities of the other telomerase proteins needed 
for RNP assembly’. This approach was used 
to reconstitute human telomerase and pro- 
duce the first low-resolution structure® of 
the enzyme using electron microscopy (EM). 
However, telomerase complexes made in this 
way can contain different subunits and have 
varying structures, both of which limit the 
resolution of EM structures. 

Nguyen et al. report an improved purifica- 
tion procedure that yields substantially more- 
active and more homogeneously assembled 
telomerase. Armed with this highly puri- 
fied enzyme, the authors used cryo-electron 
microscopy (cryoEM) to reconstruct the struc- 
ture of human telomerase at subnanometre 
resolution. This resolution is three times bet- 
ter than that of the previously reported struc- 
ture, and allowed the authors to reliably work 
out how the various telomerase components 
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correlate with the regions of electron 
density mapped out by cryoEM. 

The authors’ structure has a bilobal 
shape (Fig. 1) consistent with the ear- 
lier low-resolution structure. This 
shape was previously interpreted’ as a 
telomerase dimer — that is, each lobe 
was thought to contain a copy of the 
TERT and TR subunits. Nguyen et al. 
now propose a markedly different inter- 
pretation. They point out that the struc- 
ture of one of the lobes is consistent 
with high-resolution models’ of TERT 
and TR, which comprise the ‘catalytic 
core of the telomerase complex. They 
also compare their data with the crystal 
structure’ of an H/ACA RNP bound 
to a single ‘hairpin’ RNA, and postulate 
that the second lobe contains the H/ 
ACA RNP. Moreover, they propose that 
this H/ACA lobe includes a single copy 
of the TCAB1 protein, which regulates 
telomerase trafficking in the nucleus”. 
The bilobal architecture results from 
a remarkably extended RNA scaffold, 
consistent with the general model 
that TRs provide a flexible scaffold for 
telomerase-associated proteins”. The 
two distinct lobes interact through two 
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Figure 1 | Cartoon of the human telomerase enzyme. 
Telomerase enzymes catalyse the addition of short DNA- 
sequence repeats to protective caps (telomeres) at the ends of 
linear chromosomes. Nguyen et al.’ report a structure of human 
telomerase obtained using cryo-electron microscopy. The 
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another striking feature of the H/ACA 
lobe is that its proteins seem to promote 
a specific conformation of TR that brings 
the CR4/5 domain close to the catalytic 
core, where it is needed to support cataly- 
sis. This bears a remarkable resemblance 
to the T. thermophila telomerase RNP, in 
which an RNA-binding protein called 
p65 is required to remodel TR and pro- 
mote contacts with the catalytic TERT 
subunit’*'”. This resemblance suggests 
that telomerases from different organ- 
isms have co-opted different types of 
RNA-binding protein to promote RNP 
assembly. The idea that the H/ACA 
proteins direct RNA folding can be 
tested experimentally, using the new 
structural model as a framework with 
which to design and interpret incisive 
experiments. 

Nguyen and colleagues’ work sets a 
new bar for structural models of human 
telomerase, but there is still work to 
be done. A complete view of telomer- 
ase structure and function will require 
atomic-resolution models of the enzyme 
in its various functional states. Advances 
in cryoEM techniques, together with 
other structural and biophysical tools, 


bridging RNA helices. 
The discrepancies between Nguyen 


overall structure is bilobal, and consists of an extended RNA 
(blue) that acts as a scaffold for various proteins. The authors 
suggest that the H/ACA lobe — named after the H and ACA 


will no doubt continue to enhance our 
appreciation of the intricate workings of 


and colleagues’ structure and previous 
reports of the composition and organi- 
zation of human telomerase are probably 
rooted in the different methods used to 
purify the enzyme, which might yield 
distinct telomerase subcomplexes and, in 
some cases, promote formation ofa telo- 
merase dimer. Nevertheless, the present 
structure supports the hypothesis that 
telomerase from a range of organisms 
can function as a monomer’. 

In Nguyen and co-workers’ reconstruction 
of the catalytic core of human telomerase, the 
active site of the ring-shaped TERT subunit 
contains the template RNA bound toa DNA 
substrate. The two regions of TR required for 
catalytic function in vitro are known as the 
template/pseudoknot domain (t/PK) and the 
conserved regions 4/5 (CR4/5) domain. In 
the authors’ structure, the evolutionarily con- 
served pseudoknot fold in the t/PK domain 
resides on the ‘back’ side of the TERT ring, far 
from the active site. This is highly consistent 
with the pseudoknot’s location in the previ- 
ously reported structure’? of telomerase from 
the protozoan Tetrahymena thermophila, and 
suggests that this essential TR element indi- 
rectly promotes telomerase function, perhaps 
by guiding proper RNP assembly or by influ- 
encing the conformation of TERT subdomains 
during catalysis. 

The authors’ model of the H/ACA lobe 
is the first structure of a eukaryotic H/ACA 
RNP, and includes two complete sets of the 
four H/ACA proteins (dyskerin, NOP10, 
NHP2 and GAR1) bound to RNA hairpins. 


motifs within the RNA — contains two copies of the tetrameric 
H/ACA protein complex bound to two ‘hairpin’ regions of the 
RNA scaffold, as well as a TCAB1 protein. This lobe facilitates 
the assembly of the telomerase enzyme from its components, 
and controls trafficking of the enzyme in the nucleus. The other 
lobe is proposed to contain the enzyme's catalytic core, which 
includes the TERT protein, together with the pseudoknot and 
three-way junction of the RNA. The RNA sequence that is used 
as a template for the DNA repeats is also in the catalytic core, 
bound to a DNA substrate. 


The structure reveals interesting differences 
in the way in which each set of H/ACA pro- 
teins interacts with its respective hairpin. 
The foundation of the H/ACA lobe structure 
is made by dyskerin subunits, which bind 
to the base of each RNA hairpin and also 
make extensive contacts with each other. 
The remaining three H/ACA proteins seem 
to make numerous contacts with one of the 
RNA hairpins, but substantially fewer with 
the other. Lastly, the TCAB1 protein binds to 
a structure within TR known as the CAB box, 
found at the apex of the H/ACA lobe. These 
results provide a structural explanation for 
a wealth of previously reported biochemical 
data’*”* that defined the subunit stoichiom- 
etry, as well as the RNA-hairpin sequence and 
structure requirements for stable assembly, of 
the TR H/ACA RNP. 

The structural segregation of the catalytic 
core and H/ACA lobe raises the question of 
why the telomerase complex has retained the 
large H/ACA RNP throughout evolution. 
Part of the answer is likely to be that TCAB1 
is needed to mediate trafficking of telomerase 
components during RNP assembly. However, 


this fascinating enzyme. = 
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Diametric neural ensemble dynamics in 
parkinsonian and dyskinetic states 


Jones G. Parker!?, Jesse D. Marshall*>’, Biafra Ahanonu!, Yu-Wei Wu’, Tony Hyun Kim!, Benjamin F. Grewe’®, 
Yanping Zhang", Jin Zhong Li’, Jun B. Ding*, Michael D. Ehlers*®* & Mark J. Schnitzer!3* 


Loss of dopamine in Parkinson's disease is hypothesized to impede movement by inducing hypo- and hyperactivity 
in striatal spiny projection neurons (SPNs) of the direct (dSPNs) and indirect (iSPNs) pathways in the basal ganglia, 
respectively. The opposite imbalance might underlie hyperkinetic abnormalities, such as dyskinesia caused by treatment 
of Parkinson’s disease with the dopamine precursor L-DOPA. Here we monitored thousands of SPNs in behaving mice, 
before and after dopamine depletion and during L-DOPA-induced dyskinesia. Normally, intermingled clusters of dSPNs 
and iSPNs coactivated before movement. Dopamine depletion unbalanced SPN activity rates and disrupted the movement- 
encoding iSPN clusters. Matching their clinical efficacy, L-DOPA or agonism of the D2 dopamine receptor reversed 
these abnormalities more effectively than agonism of the D; dopamine receptor. The opposite pathophysiology arose in 
L-DOPA-induced dyskinesia, during which iSPNs showed hypoactivity and dSPNs showed unclustered hyperactivity. 
Therefore, both the spatiotemporal profiles and rates of SPN activity appear crucial to striatal function, and next- 
generation treatments for basal ganglia disorders should target both facets of striatal activity. 


According to the classical rate model of Parkinson's disease, dSPNs and 
iSPNs normally have balanced activity rates, and the loss of dopamine 
induces an activity imbalance that suppresses movement by enhancing 
basal ganglia inhibition of the thalamo-cortical motor system“. The 
premise of the rate model, that the direct and indirect pathways are 
functionally opposed, concisely explained multiple aspects of basal gan- 
glia function and dysfunction and led to descriptions of dystonia and 
Huntington's disease, Tourette’s syndrome’, dyskinesia‘, addiction’, 
chronic pain’, depression? and schizophrenia!” based on imbalanced 
SPN activity rates. However, without cellular-level recordings that 
enable the comparison of dSPN and iSPN activity levels, no definitive 
evidence ofa striatal activity imbalance has been shown in any of these 
conditions. 

Optogenetic studies support the idea that dSPNs and iSPNs have 
opposing roles in movement initiation'’. However, fluorescence record- 
ings have recently shown that both cell types normally activate at move- 
ment onset with similar time courses!*~!*. These results challenge the 
rate model and reinvigorate models in which the direct and indirect 
pathways coordinate action selection by concurrently activating and 
suppressing competing motor programs, respectively!, The two path- 
ways could still be opposed in some manner, but the oversimplicity of 
the rate model may underlie its inability to explain several notable fea- 
tures of Parkinsonr’s disease!®, including the loss of movement-specific 
activity in nuclei downstream of the striatum!’. Therefore, imbalanced 
SPN dynamics might only partially account for the motor symptoms 
of Parkinson's disease. 

L-DOPA is the mainstay therapy for treating these symptoms, but 
whether it alters SPN activity and why it is superior to dopamine- 
receptor agonism remain unknown’. Notably, the efficacy of L-DOPA 
gradually decreases, as patients increasingly exhibit involuntary move- 
ments termed L-DOPA-induced dyskinesia (LID). Effective alterna- 
tives are needed to delay L-DOPA treatment and supplement or replace 
it upon loss of efficacy. LID is not fully understood, but might involve 
an activity imbalance opposite to that found in Parkinson's disease, 


with dSPN hyperactivity!®. However, as with Parkinson's disease direct 
proof for these ideas has been lacking. To examine these issues, we com- 
pared dSPN and iSPN dynamics in freely behaving mice, under normal 
and parkinsonian conditions, before and after common treatments for 
Parkinson's disease and during LID. 


Striatal encoding of movement 

To selectively monitor dSPNs or iSPNs in the dorsomedial striatum, 
we respectively injected Drdla‘’ (Drd1a is also known as Drd1) 
or Adora2a™ driver mice with a virus mediating Cre-dependent 
expression of the green-fluorescent Ca”t-indicator GCaMPom”™! 
(Methods). Control studies in live striatal tissue slices confirmed that 
somatic Ca?+ dynamics report spiking equivalently in dSPNs and iSPNs 
(Extended Data Fig. 1a-i). 

To track SPN activity over weeks without affecting locomotor 
behaviour, we used a head-mounted microscope”? and time-lapse 
microendoscopy™ to image SPN dynamics as mice explored an 
open arena (Fig. 1 and Extended Data Fig. 1), k). We computation- 
ally extracted activity traces of individual SPNs from the Ca** movies 
(262 +11 dSPNs (mean +s.e.m.), 277 + 8 iSPNs per mouse per session; 
369 total 1-h-long sessions in 17 Drdla‘” mice, 21 Adora2a“” mice). 
Ca’* transient events occurred at similar rates for dSPNs and iSPNs 
and had waveforms consistent with those in striatal slices (Extended 
Data Fig. 11-n). 

During movement, Cat event rates rose markedly in both SPN 
types with similar dependencies on locomotor speed (Fig. 2a, b and 
Supplementary Videos 1, 2). Averaged over all Ca?* events, locomo- 
tor speed rose after Ca”* excitation with indistinguishable latencies 
in dSPNs and iSPNs (192 +9 ms (mean £s.e.m.) and 196 +11 ms, 
respectively; P=0.8; rank-sum test). Averaged over all movement 
bouts, mean SPN activity rose approximately 1s before motion onset 
(Fig. 2c), with mean times to reach the half-maximum amplitude that 
were indistinguishable given the approximately 40-ms resolution of 
our statistical analysis (Drdla‘ mice, —0.53 +0.04s (mean +s.e.m.) 
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Fig. 1 | Ca”* imaging of SPNs in freely behaving mice. a, Timeline of 
Ca’* imaging sessions, 6-OHDA infusion and drug treatments. SKF, 
SKF81297. b, An Inscopix miniature microscope and microendoscope 
imaged SPN activity patterns. Infusion of 6-OHDA into substantia nigra 
(SNc) on day 0 ablated dopamine cells. c, Somata (red) of 573 dSPNs and 


relative to motion onset, 17 mice, 5,372 cells, 2,899 movement bouts; 
Adora2a™ mice, —0.48 + 0.04, 21 mice, 5,880 cells, 3,559 movement 
bouts; P=0.7; rank-sum test). 

SPN activity was markedly clustered during movement, although the 
clusters of coactive cells were not absolutely stereotyped'*. The spatial 
coordination of coactive SPNs rose and fell at motion onset and offset, 
respectively, with time courses that resembled those of the Ca”* activity 
(Fig. 2c and Extended Data Fig. 2). Cell pairs within 100 1m or less were 
especially coactive, but beyond this distance pairwise activity correla- 
tions gradually decreased (Fig. 2d, e). By comparing the sets of cells 
that were activated during five different types of movement, we found 
that SPNs activated during bouts of the same movement type were, on 
average, more similar and closer together than those activated during 
different movement types'* (Extended Data Fig. 3 and Methods). 

To examine how dSPN and iSPN activity patterns were inter-related, 
we imaged their concurrent dynamics by two-photon microscopy in 
head-fixed mice in which both SPN-types expressed GCaMP6m and 
only dSPNs expressed the red-fluorescent marker td Tomato (Extended 
Data Fig. 4 and Supplementary Video 3). As the mice ran in place, dSPN 
and iSPN dynamics tracked locomotor behaviour indistinguishably. 
Nearby (20-100 1m) SPNs were equally coactive with other SPNs 
regardless of type, and the two cell populations encoded running speed 
equivalently. 


Striatal activity after dopamine depletion 

We next examined SPN dynamics in a classic model of Parkinson's 
disease”’. Ipsilateral to the imaged striatum, we lesioned dopamine 
cells of the subtantia nigra pars compacta (SNc) using the neurotoxin 
6-OHDA. Immunostaining for tyrosine hydroxylase, a marker of 
dopamine neurons, confirmed that the lesioned SNc lacked dopamine 
cell bodies 24h after 6-OHDA infusion (Extended Data Fig. 5a, b). 
Following Ca’* imaging, immunofluorescence intensities in the ipsi- 
lateral striatum more than 17 days after 6-OHDA infusion were 6 + 1% 
(mean +s.e.m.) of those contralateral to the lesion (P=6 x 107°; n=25 
mice; signed-rank test; Fig. 3a), verifying loss of dopamine neuron 
axons. 

One day after the mice received 6-OHDA, locomotion was reduced 
relative to mice that had received saline infusions (Fig. 3b, c and 
Extended Data Fig. 5c). During Ca** imaging, background fluores- 
cence was also reduced, but with almost no effect on the detection 
of Ca** events or SPNs (Extended Data Fig. 5d—h). Consistent with 
the rate model of Parkinson's disease, post-lesion (1 day) activity rates 
during rest and locomotion were decreased in dSPNs and increased 
in iSPNs compared to pre-lesion values (Fig. 3d-f and Extended Data 
Fig. 6). These effects resembled those of acute treatment with either D, 
dopamine receptor (D1R) or D2 dopamine receptor (D2R) antagonists 
(Extended Data Fig. 7), which can cause parkinsonism in humans”®. 

By 14 days after the lesion, a time point that may be more represent- 
ative of the sustained dopamine depletion in patients with Parkinson's 
disease”’, locomotor activity had risen but remained below normal 
levels (Fig. 3c). iSPN activity remained elevated at baseline but barely 
rose at movement onset, no longer reflected locomotor speed and had 
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356 iSPNs in example Drdla‘™ (left) and Adora2a™ (right) mice, atop 
mean fluorescence images of the dorsomedial striatum. Scale bar, 125 1m. 
d, Example Ca?" activity traces in mice exploring an open arena. Grey 
shading, running periods; white shading, resting periods. 


lost nearly all of its spatial coordination (Fig. 3d-j). These abnormal- 
ities were also present during a rotarod assay of motor coordination 
(Extended Data Fig. 6c—e). The iSPN ensembles activated during dif- 
ferent types of movement also became less distinctive after dopamine 
depletion, indicating reduced encoding specificity (Extended Data 
Fig. 8). By comparison, dSPN activity rates remained consistently 
depressed but still rose at motion onset and retained spatial coordi- 
nation, a clear relationship to locomotion and selectivity for specific 
movement types (Fig. 3d-j). Thus, between acute and extended dopa- 
mine depletion, iSPN but not dSPN ensembles underwent pronounced 
changes in how their spatiotemporal dynamics encoded locomotion. 


Treatment of aberrant neural dynamics 
We next examined whether dopamine replacement or receptor agonism 
could reverse these aberrant SPN dynamics. Across days 15-20 after 
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are normalized to mean values in resting mice. Kinks in speed traces at 
motion onset/offset reflect the definition of a motion bout, for which the 
minimum speed to initiate a bout exceeds that to sustain one. d, Mean 
pairwise coactivity during locomotion versus the separation of cell pairs, 
normalized to values in temporally shuffled datasets (dashed line). Cyan, 
proximal (20-100 1m) cell pairs. e, Coactivity of proximal cell pairs 
exceeded that of the activity in shuffled datasets (**P <5 x 1074; signed- 
rank test comparing real and shuffled data; n =17 Drdla‘”’, 21 Adora2a° 
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Box-and-whisker plots in all other figures are formatted identically. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


e ORest Moving f 


Intact 


b j : 
7 ee: 


Lesion 


== Pre-lesion 


Normalized © 


distance travelled 


a 8 a 

ES E 

o@ o 

° 5 2 

d F B3 i 
g dSPNs iSPNs 2G 2 
gf 0.05Ca 353 g 
or 4 ee 2 o 
=o events s a7 2 &, 
@ | | per cell O° l/ io) 


m Pre-lesion mii day mi14 days 1 min 


9 123 4 
Locomotor 
speed (cm s“') 


Locomotor 
speed 


Fig. 3 | Dopamine loss differentially alters dSPN and iSPN activity 
patterns. a, Brain slices were immunostained for tyrosine hydroxylase 
(red) >14 days after unilateral 6-OHDA lesion and show unilateral loss 
of dopamine cell bodies and axons. White lines, brain area boundaries*®. 
DAPI was used to stain the nuclei (blue). Scale bars, 1 mm. b, Example 
locomotor trajectories (15 min), before 6-OHDA lesion and 1 and 14 days 
after the lesion. c, Distance travelled, normalized for each mouse to its pre- 
lesion value. **P <3 x 10-3 compared to pre-lesion; *P = 0.013 comparing 
1 day to >14 days; signed-rank test, n = 25 mice. d, Traces of locomotor 
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6-OHDA lesion. e, Mean Ca?* event rates versus speed, before and after 
6-OHDA lesion, normalized to mean rates at rest (<0.5cms~') before 

the lesion. f, Ca** event rates (mean £s.e.m.) before and after 6-OHDA 
lesion, normalized to pre-lesion values. ***P <5 x 10717; signed-rank 
test comparisons to pre-lesion; n = 12 Drdla‘”, 13 Adora2a™ mice. NS, 
not significant. g, Mean Ca?* event rates (top), SPN spatial coordination 
(middle) and locomotor speed (bottom) relative to motion onset and 
offset, before and 14 days after 6-OHDA lesion. Given their similarity 
(Fig. 2c), pre-lesion traces for both SPN-types were combined. Traces are 
normalized to mean values from —2s to —1s relative to motion onset. 
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(P=10-*! (Drdla™); P=10~'!! (Adora2a“”)), but was more profoundly 
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6-OHDA infusion, we successively administered 1 or 6mgkg™! ofa 
D2R agonist (quinpirole), a D1R agonist (SKF81297), or L-DOPA. As in 
past work using the 6-OHDA model of Parkinson's disease, these agents 
induced locomotor turning contralateral to the lesioned SNc?>, to equi- 
potent levels within the tested dosage ranges (Fig. 4a and Extended 
Data Fig. 9a-e). 

All three drugs also increased dSPN activity, unexpectedly so for 
quinpirole, given the lack of D2Rs on dSPNs”®”? (Fig. 4b-d). Quinpirole 
and the 6 mgkg~! 1-DOPA dose, but not SKF81297, further amelio- 
rated the activity imbalance by reducing the elevated iSPN activity 
in resting mice. The two agonists had divergent effects from those in 
normal mice, in which they both unexpectedly depressed SPN activity 
(Extended Data Fig. 7d-fand Supplementary Note). Given the unilat- 
eral lesion and systemic drug administration, drug effects in the con- 
tralateral striatum or even outside striatum could contribute to these 
observations, although the brain’s greatest densities of dopaminergic 
inputs and dopamine receptors are in the striatum*”. 

Beyond rebalancing SPN activity, - DOPA mostly restored the cou- 
pling to locomotion and spatial clustering of iSPN activity (Fig. 4b, e-i). 
D2R agonism failed to restore the iSPN activity rise at motion onset, 
whereas D1R agonism rescued neither the elevated rates nor spatial 
clustering of iSPN activity (Fig. 4c, f-i). Thus, although L-DOPA and 
both agonists reversed the activity imbalance, only L-DOPA reversed 
the deficits in both motion-related iSPN activity and its spatial coordi- 
nation. These differences in treating neural abnormalities arose even 
at doses equally affecting turning behaviour, the standard metric of 
therapeutic efficacy in this model of Parkinson's disease”®. 

Late in the treatment of Parkinson's disease, patients commonly 
develop LID, which we modelled by giving parkinsonian mice a 
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mouse lines at both time points. Spatial coordination of iSPN (P= 107°) 
but not dSPN activity (P = 0.4) decreased on day 14 to below pre-lesion 
values. n = 1,326-2,957 bouts with >2 active cells, for both mouse lines 
and time points; rank-sum test. h, Mean speeds relative to occurrences 

of Ca?+ events, normalized to values 2s before Ca”+ excitation. At 14 
days after 6-OHDA lesion, the mean speed rise within +500 ms of a 

Ca?* event was 0.8 + 0.3% (mean +s.e.m.) for iSPNs, 11 + 1% for dSPNs. 
n= 3,498-5,529 cells per day, for both mouse lines. i, Rises in Ca?* event 
rates at motion onset, based on data from g. For each group, we averaged 
over all movement bouts and compared the event rates during the intervals 
—2 to —1s before and 0-1s after motion onset. Values normalized to the 
pre-lesion values. *P= 0.03; **P=9 x 1077; rank-sum test compared to 
pre-lesion values; n = 12 Drdla“* and 13 Adora2a‘™ mice. j, Coactivity 
values in proximal SPN pairs (20-100 1m apart) during locomotion, after 
subtraction of coactivity values in shuffled datasets. Values normalized 

to pre-lesion values. **P <2 x 1074, ***P<4 x 10~; rank-sum test 
comparing pre-lesion to 1 day and > 14 days; n = 96-416 coactivity values 
for both mouse lines and all three time points. c, e-j, Data are from the 
same 12 Drdla™ and 13 Adora2a™ mice. e, g, h, Colour shading indicates 
the s.e.m. Pre-lesion data were acquired on day —5. Data for > 14 days 
were from 1-h recordings on day 14 plus 30-min recordings on days 15, 
17 and 20 performed after saline vehicle administration but before drug 
treatment. 


dyskinesiogenic (10 mgkg~') L-DOPA dose*! that induced abnormal 
facial, limb and trunk movements (Extended Data Fig. 9f-j). 
During dyskinesia, dSPNs were hyperactive and iSPNs hypoactive. 
Furthermore, dSPNs lost their spatial coordination and movement- 
evoked responses, whereas iSPNs became more responsive to motion 
onset (Fig. 5). Thus, the signature dSPN and iSPN dynamics in 
parkinsonian and dyskinetic states were diametric opposites of each 
other (Supplementary Video 4). This anti-symmetry provides an 
attractive explanation for the opposite motor symptoms of Parkinson's 
disease and LID. 


Implications for basal ganglia function 

Our data show dSPNs and iSPNs normally have indistinguishable 
kinetics at motion onset and offset’? (Fig. 2), challenging the idea 
that the two cell types respectively mark movement start and end”. 
Activation patterns in the two SPN types were comparably clustered, 
consistent with previous reports!*4, Supporting one recent study’, 
the clusters of active SPNs that we identified were neither absolutely 
stereotyped nor spatially compact. 

The ensemble activity that we observed in both SPN types precisely 
encoded locomotor speed (Extended Data Fig. 41, m) and as in recent 
work’ arose in clusters that preferentially encoded specific movement 
types (Extended Data Fig. 3, 10). Simultaneous Ca** imaging in both 
SPN types revealed interspersed patterns of dSPN and iSPN activity 
(Extended Data Fig. 4j, k). Together, these findings pose a challenge 
to ‘suppression-selection’ models", in which the main iSPN function 
is to broadly suppress competing actions other than the one selected. 
Such models predict that isPN dynamics should be less selective, more 
diffuse and perhaps less velocity-sensitive than those of dSPNs. Instead, 
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it appears that iSPNs and dSPNs jointly engage in movement selection 
via synchronized activation in interspersed cell clusters of comparable 
size coupled indistinguishably to locomotion. The data here come from 
the dorsomedial striatum, and future work should explore whether the 
findings generalize to other striatal regions. 

The spatial extent of dSPN and iSPN coactivation resembles that 
of corticostriatal neurons with focal axonal arborizations (around 
100-300 pm wide) within the rodent striatal matrix’. Striatal areas 
receiving these projections are termed matrisomes and have soma- 
totopically organized cortical inputs and projections to basal ganglia 
output nuclei**. Thus, the spatiotemporal organization of cortical input 
and the convergence of signals from the direct and indirect pathways 
are probably both vital for sculpting the motor output signals of the 
basal ganglia'>**"° (Extended Data Fig. 11). 


Signatures of a parkinsonian state 

The notion that an excess or deficiency of dopamine signalling unbal- 
ances striatal dynamics has long provided the conceptual basis for treat- 
ing basal ganglia disorders'*>~'°, but direct evidence of imbalanced 
striatal activity has been lacking. We uncovered opposite imbalances 
in dorsomedial striatum after dopamine depletion and LID, and addi- 
tional facets of SPN pathophysiology that probably contribute to the 
symptoms of Parkinson's disease, their reversal by dopamine restora- 
tion, and the hyperkinesia of LID. 

We found four distinct signatures of a parkinsonian state: reduced 
spontaneous dSPN activity; increased spontaneous iSPN activity; 
reduced coupling of iSPN activity levels to motion onset and offset; and 
decreased spatial clustering of iSPN activity (Extended Data Fig. 11). 
Collectively, these signatures expose the incompleteness of the rate 
model. Notwithstanding, these findings support the models predictions 
for how spontaneous rates of SPN activity change in Parkinson's dis- 
ease. Abnormal rates of spontaneous SPN activity may indeed impede 
movement by increasing activity in the inhibitory output projections 
of the basal ganglia. The loss of spatially clustered iSPN activity and 
its coupling to movement lie beyond the rate model and probably 
also contribute to parkinsonian symptoms. These unexpected deficits 
might indicate disruptions to matrisome*** function that preclude 
the spatiotemporal dynamics required in basal ganglia output nuclei 
for well-choreographed actions. The added deficits did not arise after 
acute dopamine depletion or D1R or D2R antagonism and are consist- 
ent with electrophysiological recordings in a genetic mouse model of 
progressive dopamine cell loss**. 

These progressive pathophysiologic changes probably reflect neural 
adaptations that occur during chronic dopamine depletion, many of 
which preferentially affect iSPNs, such as reduced dendritic spine den- 
sity and increased input from inhibitory interneurons (Supplementary 
Note). Notably, the long-term adaptations in SPN dynamics were pre- 
cisely those best treated by -DOPA and D2R agonism. The mech- 
anisms by which these treatments immediately counteracted the 
long-term facets of SPN pathophysiology are unclear but important to 
identify, because the capabilities of the different compounds for revers- 
ing the neural abnormalities matched their clinical utilities'® (in order 
of efficacy: L-DOPA, D2R agonism, D1R agonism) and might underlie 
the varying utility of dopamine agonist monotherapy versus that of 
L-DOPA in different stages of Parkinson's disease*”. 


Anti-symmetric deficits in LID 

Notably, during LID the dorsomedial striatum exhibits diametric 
abnormalities to those in the parkinsonian state. Dyskinesia is charac- 
terized by decreased spontaneous iSPN activity, increased spontaneous 
dSPN activity and a loss of spatially clustered and movement-coupled 
dSPN activity. These features are the polar opposite of those of the 
parkinsonian state and may explain the opposite motor symptoms of 
the two conditions. They also fit with reported cellular adaptations in 
Parkinson's disease versus LID, including reductions of dendritic spines 
on iSPNs and dSPNs, respectively!*””. A caveat is that we examined a 
model of acute LID, which differs from models with more prolonged 
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mice at rest (<0.5cms-') after saline injection. Colour shading indicates 
s.e.m. b-h, Data are from the same 7 Drd1a‘"’, 6 Adora2a“ mice. 

e, f, Ca?* event rates when mice were resting (e) or moving (f). Values 

in e-h were normalized for each mouse to measured values after 

saline treatment and 6-OHDA lesion (dashed lines). n = 14 speed bins 
<0.5cms!, n=24 speed bins >0.5cms~!, n=7 Drdla™, 6 Adora2a’ 
mice. g, Increases in Ca?* event rates at motion onset. For each treatment, 
we averaged over all motion bouts and computed the mean rate 0-2 after 
motion onset, normalized to rates from —2 to —1s before motion onset 
(dashed lines). n= 11 time bins per mouse per treatment. h, Coactivity in 
proximal SPN pairs (20-100 jm apart) during movement, after subtracting 
coactivity values in shuffled datasets. n = 8 bins of cell-cell separation, 
n=7 Drdla‘™, 6 Adora2a™ mice. i, Summary of effects. Arrows indicate 
mean changes of >10% at the 6mg kg! dose of each drug. Encircled 
arrows indicate effects unpredicted by the rate model of Parkinson's 
disease. Statistical tests are signed-rank tests with Dunn-Sidak correction 
for multiple comparisons. tP = 2.4 x 10~> for comparisons to pre-lesion; 
*P<6x 1077, **P <6 x 10-4, ***P <6 x 1077 for comparisons to vehicle 
treatment; #P < 0.05, ##P <5 x 10-3, ###P <5 x 10-7 for comparisons 
between drug doses. Pre-lesion data were acquired on day —5. Vehicle 
data were aggregated for 30-min recordings on days 15-21 after saline 
administration but before drug treatment. 
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Fig. 5 | In LID, iSPNs are hypoactive and dSPNs have uncorrelated 
patterns of hyperactivity. a, Turning bias (mean + s.e.m.) and abnormal 
involuntary movement score (AIMS) within 40 min after saline vehicle 
or L-DOPA injection, >14 days after 6-OHDA lesion. ##P < 10? for 
comparisons of contralateral bias to vehicle treatment; signed-rank test; 
*P=10-* and **P= 10-4 comparing AIMS values to those after vehicle 
treatment; rank-sum test; n = 13 mice. 10 mg kg"! Lt-DOPA induced 
higher AIMS values than 6 mgkg~'. #P = 0.013; signed-rank test. b, Peak 
AIMS (mean +s.e.m.), separated by movement type, within 40 min after 
vehicle or L-DOPA treatment in mice >14 days after 6-OHDA lesion. 

*P < 0.05, **P <5 x 10-3 for comparisons to vehicle treatment; rank-sum 
test; n = 13 mice. c, Mean SPN Ca?* event rates versus locomotor speed 
before 6-OHDA lesion, and >14 days after the lesion following either 
vehicle or 10mgkg~! 1-DOPA treatment. Values are normalized to mean 
rates when mice were resting (<0.5cms_') after vehicle treatment. Grey 
shading indicates >0.5cms~' locomotion. Colour shading indicates 

the s.e.m. c-g, Data are from the same 7 Drdla‘”, 6 Adora2a™ mice. 

d, e, Ca”* event rates when mice were resting (<0.5cm s~!; d) or moving 
(>0.5cms~1; e). All values in d-g are normalized to those after vehicle 
treatment in lesioned mice (dashed lines). Statistical tests in d-g are 
signed-rank tests with Dunn-Sidak correction for multiple comparisons. 
*P<6x 10-3, **P<6 x 10 4, ***P <6 x 10” for comparisons to vehicle 
treatment. n= 14 speed bins <0.5cms_!, 24 speed bins >0.5 cms"! for all 
13 mice. f, Rises in Ca** event rates at motion onset. For each treatment, 
we averaged over all motion bouts and computed the mean rate 0-2 s 
after motion onset, relative to the values from —2 to —1s before onset. 
n=11 time bins per mouse per treatment. g, Coactivity in proximal SPN 
pairs (20-100 m apart) during movement, after subtracting coactivity 
values in shuffled datasets. n = 8 bins of cell-cell separation per mouse. 
h, Summary of effects. Arrows indicate mean changes > 10%. Encircled 
arrows indicate effects unpredicted by the classical rate model. Pre-lesion 
data were acquired on day —5. Vehicle data aggregates 30-min recordings 
on days 15-21 were acquired after vehicle administration but before drug 
treatment. 


courses of L-DOPA treatment known to induce a range of cellular 
adaptations”’. 

As with uncorrelated iSPN hyperactivity in the parkinsonian state, 
uncorrelated dSPN hyperactivity in LID probably also disrupts move- 
ment selection, but in this case causing excessive, unplanned move- 
ments. However, it remains unclear whether and how dSPN activity 
patterns relate to specific dyskinetic movements. Just as pharmacolog- 
ical restoration of spatially coordinated iSPN activity was associated 
with clinical efficacy in treating the symptoms of Parkinson's disease, 
reversing or preventing the loss of spatially coordinated dSPN acti- 
vity in LID might benefit patients with late-stage Parkinson's disease. 
Overall, the distinctive neural signatures of Parkinson’s disease and 
LID established here may provide biomarkers facilitating development 
of substitutes for L-DOPA or treatment regimens extending its period 
of efficacy. 


ARTICLE 


Beyond Parkinson's disease and dyskinesia, our results provide the 
impetus and means for testing whether other basal ganglia disorders 
unbalance striatal activity and disrupt its coding dynamics*’""”. As 
with Parkinson's disease and LID, it is probable that ideas beyond the 
rate model framework will be required to explain the spectrum of 
basal ganglia abnormalities and patient symptoms. Current treatment 
approaches often aim to correct activity imbalances, but our results 
indicate that next-generation therapies should also target the concomi- 
tant biological adaptations that disrupt striatal neural ensemble coding. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 

Mice. We obtained the GENSAT BAC-transgenic Cre-driver mouse lines, Drdla™’ 
(FK150) and Adora2a™ (KG139), from the Mutant Mouse Research & Resource 
Centers (https://www.mmrrc.org). We crossed these mice with wild-type C57BL/6 
mice. We inbred the resulting heterozygous offspring to generate homozygous 
BAC-transgenic animals, which we maintained through further homozygous 
crosses. To generate mice for our two-photon imaging experiments, we crossed 
homozygous Drdla* with homozygous reporter mice (Ail4 (also known as 
Gt(ROSA)26Sor'"!4(CAG-tdTomato)Hze mice); Allen Brain Institute) expressing the 
red fluorophore tdTomato in a Cre-dependent manner. We used both male and 
female mice, aged 12-24 weeks at the start of behavioural experiments. The 
Stanford Administrative Panel on Laboratory Animal Care (APLAC) approved all 
experiments. 

Virus injections. For miniature microscope and brain slice experiments, we 
used AAV2/9-CAG-FLEX-GCaMP6m-WPRE.SV40 (1.37 x 10° genome copies 
(GC) pl7}; Penn Vector Core). For two-photon imaging experiments, we used the 
Camk2a promoter to drive GCaMP expression (AAV2/9-CaMKII-GCaMP6m; 
1.8 x 10° GC ul’) in both SPN types. To inject the virus, we anaesthetized mice 
with isoflurane (2% in O2) and stereotaxically injected virus into the dorsomedial 
striatum (anteroposterior (AP): 1.0mm from bregma; mediolateral (ML): 1.5mm 
from midline; dorsoventral (DV): —2.7 mm (down to —3.2 mm then back up to 
—2.7mm from dura)). The injection needle had a 33-gauge bevelled tip. We 
injected 500 nl of virus at a rate of 250 nl min~!. After the injection was done, 
we waited 10 min and then withdrew the syringe. We recorded the bregma-dura 
distance for subsequent positioning of the optical guide tube, sutured the scalp, and 
allowed the mice to recover for a week before implanting the guide tube. 

In vitro electrophysiological and Ca”* imaging studies in striatal tissue slices. 
Using established techniques, we performed dual Ca?* imaging and electro- 
physiological studies of SPN dynamics in live brain slices obtained from Drdla™ 
and Adora2a™ mice >3 weeks after virus injection. The numerical aperture of the 
imaging system was matched to that of the miniature microscope used in freely 
behaving mice. We imaged striatal Ca”* transients at a frame acquisition rate of 
20 Hz, using a scientific-grade CMOS camera (ORCA-Flash 4.0 LT, Hamamatsu 
Photonics) and MicroManager software (NIH)*®. Concurrently, we performed 
whole-cell current-clamp recordings of SPN activity using a patch pipette 
(3-5 MQ) and a Multiclamp 700B (Molecular Devices) patch-clamp amplifier. 
Additional details can be found in the Supplementary Methods. 

Surgeries. To prepare mice undergoing 6-OHDA lesions and imaging, we anaes- 
thetized mice with isoflurane (2% in O2) and used a 0.5-mm-diameter drill bit to 
create a craniotomy dorsal to the SNc (AP = —3.5mm; ML= 1.25 mm) for inser- 
tion of a 26-gauge guide cannula (2.5-mm-long projection, Plastics One, Inc.). 
We drilled three additional small holes (0.5-mm diameter) at spatially distributed 
locations for insertion of stabilizing skull screws. 

We used a 1.4-mm-diameter drill bit to create another craniotomy 
(AP= 1.0mm; ML= 1.5mm) for implantation of the optical guide tube. We fab- 
ricated this guide tube by using ultraviolet liquid adhesive (Norland #81) to fix a 
2.5-mm-diameter disc of #0 glass to the tip of a 3.8-mm-long, extra-thin 18-gauge 
stainless steel tube (McMaster-Carr). We ground off any excess glass using a poli- 
shing wheel (Ultratec). Using a 27-gauge blunt-end needle, we aspirated the cortex 
down to DV = —2.1 mm from the dura and implanted the exterior glass face of 
the optical guide tube at DV = —2.35 mm. We lowered the SNc guide cannula to 
DV =~—1.0mm from the dura. After stereotaxic placement of these components, 
we applied Metabond (Parkell) to the skull and fixed the full assembly using dental 
acrylic (Coltene). After the acrylic hardened, we used a second batch of acrylic to 
attach an aluminium head bar to the cranium, for the purpose of head-fixing the 
mice during subsequent experimental procedures. Mice recovered for 3-4 weeks 
before two-photon imaging experiments or mounting of the miniature microscope. 
Mounting of the miniature microscope. At 3-4 weeks after guide tube implan- 
tation, we assessed the extent of Ca”* indicator expression in the dorsomedial 
striatum. To do this, we inserted a gradient refractive index (GRIN) lens (1 mm 
diameter; 4.12 mm length; 0.46 numerical aperture; 0.45 pitch; GRINTECH GmbH 
or Inscopix Inc.) into the optical guide tube and used a commercial two-photon flu- 
orescence microscope (Ultima, Bruker) to image the striatum of each mouse while 
it was anaesthetized with isoflurane (2% in O2). In mice with uniform indicator 
expression, we secured the GRIN lens in the guide tube with ultraviolet (UV)-light 
curable epoxy (Loctite 4305). 

After affixing the GRIN lens, we lowered a miniature microscope (nVistaHD, 
Inscopix Inc.) towards the GRIN lens until the fluorescent tissue was in focus. To 
secure the miniature microscope to the cranium, we created a base on the cranium 
around the GRIN lens using blue-light curable resin (Flow-It ALC; Pentron). We 
attached the base plate of the miniature microscope to the resin base using UV-light 
curable epoxy (Loctite 4305). After affixing its base plate, we released the micro- 
scope and attached a base plate cover (Inscopix Inc.). We coated the resin with 
black nail polish (Black Onyx, OPI) to make it opaque. 
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6-OHDA infusions. To block serotonin and noradrenaline transporters and 
thereby prevent lesioning of non-dopamine monoamine neurons, we injected 
mice with desipramine (25mgkg'; ip.) 30 min before 6-OHDA infusion*!. We 
anaesthetized the mice with isoflurane (2% in O2) and stereotaxically inserted 
into the craniotomy above the SNc (see ‘Surgeries’) an injection cannula (33 
gauge, 17-mm-long) that was connected to a syringe pump loaded with 6-OHDA 
(4jgpl-! in saline). We sequentially infused 6-OHDA (650 nl at 100 nl min~!) at 
three sites: DV = —4.2 mm, —4.0 mm and —3.8 mm below the dura. We withdrew 
the injection cannula 10 min after the final infusion. After infusion, we moni- 
tored the body mass of each mouse and subcutaneously administered 1 ml lactated 
Ringer’s solution to prevent dehydration. In the first 3 days after 6-OHDA infusion, 
we provided the mice with manually crumbled mouse chow, to aid recovery. 
Pharmacology. We dissolved the D1R agonist (SKF81297; 1, 3 or 6 mgkg~), DIR 
antagonist (SCH23390; 0.2 mgkg~'), D2R agonist (quinpirole; 1, 3 or 6mgkg~'), 
D2R antagonist (raclopride; 1 mgkg~'), or L-DOPA (1, 6 or 10mgkg~’) mixed 
with benserazide (15 mgkg~') in 0.9% saline solution (all compounds from Sigma- 
Aldrich). We delivered all drugs systemically to the mice via intraperitoneal injec- 
tion (10 ml kg! injection volume). We selected doses based on published receptor 
occupancy data in systemically injected rodents!“ and our own evaluations of 
the dose-dependent effects of these drugs on behaviour in hemiparkinsonian mice 
(Extended Data Fig. 9). Specifically, we aimed to select doses with overlapping 
ranges of behavioural potency to test during Ca”* imaging (Fig. 4). Note that 
SCH23390 does bind to 5HT2 receptors, but its receptor occupancy level on SHT2 
receptors is 40-50 times less than that of D1Rs at the dose we used*>“°. For all 
Ca*+t imaging sessions, we waited 15 min after drug injections before recording 
neural activity for 30 min. See Extended Data Fig. 9 for the time-dependence of 
drug efficacy. 
Experimental timeline. Before the lesion, we administered mice the D1R antag- 
onist SCH23390 (day —4, 0.2 mgkg~'), the D2R antagonist raclopride (day —3, 
1mgkg'), the DIR agonist SKF81297 (day —2, 1mgkg'), and the D2R agonist 
quinpirole (day —1, 1 mgkg~!). After the lesion, we administered quinpirole (day 15, 
Imgkg~’; day 16, 6mgkg~’), SKF81297 (day 17, 1mgkg~'; day 18, 6mgkg~'), 
and the dopamine precursor L-DOPA (day 19, 1mgkg™!; day 20, 6mgkg™!; day 21, 
10mgkg™'). On days on which mice received drug treatments, we performed 
30 min of Ca** imaging before administering the drug, and another 30 min of 
Ca*+ imaging afterward. 
Ca?* imaging in freely moving mice. Brain imaging in freely moving mice 
occurred in a circular arena (31 cm in diameter). To habituate mice to this arena, 
we allowed them to explore it for 1h on each of three sequential days before any 
Ca?* imaging. Prior to each imaging session (Fig. 1a), we head-fixed each mouse by 
its implanted head bar and allowed the mouse to walk or run in place on a running 
wheel. We then attached the miniature microscope and adjusted the focal setting 
to optimize the field-of-view, in the first imaging session, or in subsequent sessions 
to match the field-of-view to that of all prior sessions (Extended Data Fig. 1)). 
After securing the microscope to the head of the mouse, we detached the mouse 
from its head restraint and allowed it to freely explore the circular arena. After 
allowing > 10min for the mouse to habituate to the arena, fluorescence Ca** imag- 
ing commenced using 50-200,1W of illumination power at the specimen plane and 
a 20-Hz frame acquisition rate. We also acquired 20-Hz movies of the behaviour 
of each mouse using a TTL-triggered video camera (SVWGAM; EPIX Inc.), a 
varifocal lens (T3Z2910CS; Computar), an image frame grabber card (PIXCI-SI; 
EPIX Inc.) and image-acquisition software (XCAP; EPIX Inc.). We concurrently 
recorded behavioural and Ca”* imaging data in sessions generally lasting 60 min in 
total. On days on which the mouse received drug treatments (Fig. 1a), we gave the 
mouse an injection of saline vehicle, waited 5 min, and then performed 30 min of 
Ca?* imaging. We then administered the drug, waited 15 min for the drug to take 
effect, and then acquired another 30 min of Ca** imaging data. 
Two-photon Ca”+ imaging in behaving mice. We acquired volumetric Ca”* 
imaging data from four different axial planes in the brains of head-fixed mice free 
to behave on a running wheel. The mice expressed the red fluorophore tdTomato 
in dSPNs and GCaMP6m in both SPN types (see ‘Mice’). After implantation of an 
optical guide tube and GRIN microendoscope in the dorsomedial striatum (see 
‘Surgeries’), we head-fixed the mice beneath the microscope objective lens carrier 
of a custom-built two-photon microscope*”. The microscope was equipped with a 
resonant galvanometer laser-scanning mirror (CRS 8 K, Cambridge Technologies), 
allowing a 30-Hz frame acquisition rate (512 x 512 pixels) under the control of 
ScanImage 5.2 software*® (Vidrio Technologies). Additional details can be found 
in the Supplementary Methods. 
Rotarod assay. Before day —5 (Fig. 1a), we trained mice for 3-4 days on an 
accelerating rotarod (EzRod; Omnitech) to ensure that they could remain on 
the rotarod. Each mouse performed three trials per day, with the rod accelerat- 
ing from 6 to 60 r.p.m. in 3 min. During subsequent rotarod sessions (days —5, 
1 and 14), we imaged SPN Ca”* activity as mice walked on the rotarod at a con- 
stant speed of 5 r.p.m. for 10 min. After dopamine depletion, we excluded from 
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analyses of SPN Ca*+ activity any mouse that could not stay on the rotarod for 
the full 10 min of Ca** imaging. 

Turning bias assay. To eliminate any potential effects on mouse turning behaviour 
from the miniature microscope or its cable, we studied the behaviour of a separate 
cohort of mice that underwent unilateral dopamine depletion. This allowed us 
to assay the dose-dependent behavioural effects of the same drugs that we stud- 
ied with in vivo Ca** imaging. We manually scored the number of rotations that 
mice made ipsilateral and contralateral to the right cerebral hemisphere during 
1h recordings of open field locomotion. We computed the rotational bias of each 
mouse as the difference between the total numbers of rotations in each direction. 
We then lesioned the right SNc with 6-OHDA and repeated this scoring procedure 
1 day and 14 days after the lesion. 

We then evaluated the effects of intraperitoneal injections of saline vehicle, 
quinpirole (1, 3 and 6mgkg~'), SKF81297 (1, 3 and 6mgkg~') and L-DOPA 
(1,6 and 10mgkg') during 3h of locomotion in the open field, on sequential days. 
We manually scored the rotational bias of each mouse off-line in 20-min time bins 
(Extended Data Fig. 9a—d). 

Dyskinesia assay. In the same mice and behavioural sessions used to quantify the 
turning bias, we measured the abnormal involuntary movement score (AIMS) for 
each mouse following established procedures*". This score is a well-known metric 
used to quantify dyskinesia levels in parkinsonian mice in response to a range of 
drug treatments and experimental manipulations*". In brief, for a period of 3h 
after systemic drug injection, every 20 min we assigned each mouse an AIMS 
value by observing the behaviour of the mouse for a sample period of 60s. We 
monitored the mouse for the presence of abnormal (i) axial, (ii) limb and (iii) 
orofacial movements. To provide multiple viewing perspectives of the mouse 
during AIMS scoring, we placed a pair of rectangular mirrors (30.5 cm x 45.7cm) 
outside the open field arena, at 90° to each other, behind the mouse and opposite 
to the experimenter. During each 60-s sample period, we assigned the mouse a 
score of 0-4 in each category (i-iii). A score of ‘0’ signified no observed abnor- 
malities. A score of ‘1’ or ‘2’ signified abnormal behaviour that respectively lasted 
<30s or >30s. A ‘3’ signified abnormal behaviour that lasted 60s but that could 
be interrupted by an external stimulus (for example, a loud clap). A “4 indicated 
abnormal behaviour that lasted 60s and could not be interrupted. For each 60-s 
sample period, we summed the individual category AIMS values to attain the 
total AIMS value. 

Histology. After all behavioural and Ca* imaging experiments, we euthanized 
and intracardially perfused the mice with phosphate-buffered saline (PBS) and 
then a 4% solution of paraformaldehyde in PBS. We sliced the fixed brain tissue 
using a vibratome (Leica VT1000s) to obtain 80-j1m-thick coronal sections. We 
immunostained the tissue sections with antibodies against tyrosine hydroxylase 
(1:500, Aves TYH) and GFP (1:1,000, Molecular Probes A-11122) and then applied 
fluorophore-conjugated secondary antibodies (Jackson Immunoresearch 703-586- 
155 and 711-546-152, respectively). We visualized immunofluorescence in our 
sections using a slide-scanning microscope (Olympus VS-120) and confirmed 
that all mice included for post-lesion analyses had a complete unilateral loss of 
dopamine neurons in the SNc. 

Video analysis of mouse behaviour. We used custom software written in 
MATLAB to analyse the videos (20-Hz frame acquisition rate) of mouse behaviour 
acquired during Ca”* imaging with the miniature microscope. First, we ranked all 
the pixels in each video frame according to their intensity values. Under the lighting 
conditions used to acquire the videos, the mouse appeared much darker than the 
surrounding open field arena (Supplementary Videos 1, 2). Therefore, we next 
determined the centroid location of the mouse in each frame as the mean location 
of the 1,500 darkest pixels. By visual inspection of each movie, we confirmed that 
this approach reliably tracked the movements of each mouse and yielded an accu- 
rate record of its position. To match the temporal resolution of our down-sampled 
Ca** recordings, we down-sampled the locomotor speed traces to 5 Hz via linear 
interpolation. We classified each 200-ms time bin as one in which the mouse was 
either ‘resting’ or ‘moving, according to the down-sampled speed trace and using 
a threshold value of 0.5cms ' to separate the two states. If the mouse was ‘moving’ 
in two time bins separated by <1, we classified the intervening time bins as ones 
in which the mouse was ‘moving. 

We applied these analytic procedures uniformly to the behavioural videos across 
all experimental and drug treatment conditions. Under dyskinetic conditions, mice 
commonly exhibit ongoing movements of the face, limbs and trunk. In general, 
these movements did not translate the body centroid sufficiently rapidly to register 
as locomotor episodes. Nevertheless, an important caveat is that under dyskinetic 
conditions a greater number of sub-threshold movements may have occurred 
during the computationally identified ‘resting’ periods than under normal or par- 
kinsonian conditions. 

Classification of movement types. To study and compare movements of different 
types made by freely moving mice (Extended Data Figs. 3, 8), we used custom 
software written in MATLAB (Mathworks) to extract from the behavioural videos 


the intervals from —4 to 4s surrounding the onset of each movement bout. Using 
this software, we visually scored each movement bout as an instance of forward 
locomotion, a left or right turn, grooming, or upward rearing. Additional details 
are shown in Extended Data Fig. 3. 

Quantification of movement in head-restrained mice. To analyse the locomotor 
behaviour of mice during two-photon Ca”* imaging sessions (see “Two-photon 
Ca** imaging in behaving mice’; Extended Data Fig. 4), we computed the running 
speed of the mouse using the signals from a rotary encoder on the running wheel 
and identified movement periods during which the running speed was above a 
minimum threshold. Additional details are shown in Extended Data Fig. 4. 
Basic processing of in vivo fluorescence Ca”* movies. To reduce processing 
times, we spatially down-sampled the raw Ca” videos from the miniature micro- 
scope using 4 x 4 bi-linear interpolation. To correct images for lateral motions 
of the brain, we used the image-alignment algorithm, TurboReg”, to perform 
a rigid image registration across all frames of the movie. To correct for fluctua- 
tions in background fluorescence intensity, we applied a Gaussian low-pass filter 
to each image, which smoothed all structures <100 1m in size, and then divided 
each image frame by its low-pass filtered version. Using the resultant normalized 
movie (F(t)), we subtracted the time-averaged, mean fluorescence intensity (Fo) 
of each pixel to obtain the difference movie, AF(t) = F(t)—Fo. We then divided 
this difference by Fo to obtain movies of relative fluorescence changes, AF(t)/Fo. 
Finally, we temporally down-sampled (4x) the AF(t)/Fo movies to a 5-Hz frame 
rate via a linear interpolation, and we used the resulting videos for extraction of 
individual neurons and their Ca?* activity traces. 

We processed the raw two-photon Ca?* videos with the same procedures, but 

with two exceptions: we did not down-sample the movies in space or time, and 
we applied a median spatial filter (6 x 6 pixels) to the two-photon AF(t)/Fy mov- 
ies before proceeding to cell sorting. For display purposes only (Supplementary 
Video 3), we converted the fluorescence trace, F(t), of each pixel into a trace of z 
scores, AF(t)/o. Here AF(t) = F(t)—Fo denotes the deviation of each pixel from 
its mean value, Fo, and o denotes the background noise of each pixel, which we 
estimated by taking the standard deviation values calculated with a sliding 250-s 
window. 
Identification of individual neurons. We identified individual cell bodies 
in the brain imaging videos by applying to the pre-processed one- and two- 
photon AF(t)/Fo movies an established algorithm for cell sorting based on prin- 
cipal component and independent component analyses (PCA-ICA)™. As in pre- 
vious work, the extracted spatial filters generally had sizes, morphologies and 
corresponding Ca” activity traces that were characteristic of individual neurons, 
but there were also some filters that were obviously not neurons that we discarded. 
We verified every cell included in the analyses by visual inspection. We did not 
register individual cells over multiple days; hence all analyses characterizing the 
rates, amplitudes and durations of Ca”* events (Extended Data Fig. 11-n) were 
conducted separately for each day of experimentation, over the uniform imaging 
duration (1h per session). 

Cells identified in the two-photon Ca* videos underwent further analyses, 
to classify each cell as being either an iSPN or dSPN, and to remove duplicate 
copies of individual cells found in more than one image plane. We first manually 
identified the centroids of all visible cells in each plane labelled with tdTomato. 
We then compared these centroid locations to those of the cells extracted from the 
two-photon Ca”* videos and matched each tdTomato-expressing cell to the nearest 
unassigned cell in the Ca”* video (Extended Data Fig. 4b, c). 

To remove duplicate copies of individual cells seen on more than one plane of 

the two-photon Ca”* videos, we computed for all nearby cells found on different 
planes both the distance between the centroids of the two cells and the Pearson 
correlation coefficient of their AF(t)/Fo activity traces. We identified duplicate 
copies of the same cell as cell pairs within 151m of each other and with an activity 
correlation coefficient >0.7. Among the duplicates, we retained the one with the 
Ca’* activity trace with the highest signal-to-noise ratio. 
Detection of Ca”* transients. After identifying individual cells in the one- and 
two-photon brain imaging datasets, we detected Ca”* events in the fluorescence 
trace of Ca?* activity of each cell using a threshold-crossing algorithm to detect 
fluorescence peaks that were >3 s.d. above the baseline signal value for >1.4s. We 
determined the time of each Ca”* event as the temporal midpoint between the 
time of the Ca** event's fluorescence peak and the most recent preceding trough 
in fluorescence”’. 

On 6.4+3% (mean +s.d.) of all detected Ca** events in the one-photon data- 
sets, bright fluorescence increases from a Ca** event in one cell occupied more 
pixels than a single spatial filter. To prevent contamination of the time traces of 
other cells we took a conservative approach and allowed only a single cell among 
a group of neighbours to register a Ca”* event in a single 200-ms time frame. We 
defined neighbours as cells that had centroids within 201m of one another”’. If 
multiple Ca”* events arose within 200 ms in neighbouring cells, we retained only 
the transient with the greatest peak value. 
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To characterize the waveforms of the Ca?* transients of the cells (Extended 

Data Figs. 11-n, 5f-h), we took the spatial filter of each cell, zeroed all pixels with 
values <50% of the peak value, and applied the truncated filter to the movie of 
AF(t)/Fo to generate a time trace of Ca?" activity (Fig. 1d). Using the previously 
identified Ca?+ events, we then considered the statistical distribution of Ca?* 
transient waveforms among all events and SPNs of a given type. From this distri- 
bution, we computed the median and interquartile range values of Ca”* transient 
amplitude and duration (Extended Data Fig. 11-n). With the exception of these 
basic characterizations of the Ca** transient waveforms, all subsequent analyses 
made use only of the occurrence times of the Ca”* transients and treated these 
occurrences as binary events. 
Determinations of coactivity levels in cell pairs. SPN ensembles displayed 
prominent temporal correlations between cells across timescales of ~1-2s 
(Supplementary Videos 1-4). To characterize correlations at these timescales, we 
first applied a ‘forward-smoothing’ by taking the binarized traces of Ca** events 
of individual cells and extending the number of time bins (200 ms per bin after 
temporal down-sampling; see ‘Basic processing of in vivo fluorescence Ca?* 
movies’) in which a cell was marked as active so as to include all frames within 1 s 
after the Ca”* event. Then, to assess the extent to which pairs of cells had correlated 
patterns of activity, for each cell pair we computed the Jaccard index characterizing 
the similarity between the binarized Ca”* activity traces of two cells. We used this 
index as a metric of pairwise coactivation and determined it as the total number of 
time bins during which both cells in a pair were active, divided by the total number 
of time bins in which either cell was active. We chose the Jaccard index because, 
unlike more common measures of correlation, such as the Pearson correlation 
coefficient, the Jaccard index does not depend on the number of time bins in 
which neither cell is active and in this sense is more sensitive to the periods of joint 
activity rather than those of joint inactivity. We computed the mean value of the 
Jaccard index as a function of the distance between the cells in a pair (Fig. 2d and 
Extended Data Figs. 4j, 6e). 

To control for changes in pairwise coactivation that might be simply owing to 
the changes in Ca”* event rates resulting from the experimental manipulations and 
drug treatments, we also computed the Jaccard index for shuffled datasets in which 
the Ca** events of each cell were randomly re-distributed in time. We performed 
1,000 independent shuffles of the Ca?+ event times for each cell, when mice were 
resting or in locomotion. To compare pairwise coactivation levels in the real data 
against those in the shuffled data, we normalized the plots of mean Jaccard index 
for the real data by the corresponding values for the shuffled data (Fig. 2d, e and 
Extended Data Figs. 4j, k, 6e). Alternatively, we subtracted the coactivity values 
determined from the shuffled datasets from those of the real data, and compared the 
resultant values across different experimental conditions for the proximal cell pairs 
(cell centroid separations between 20-100 1m lateral separation) (Figs. 3), 4f, 5g 
and Extended Data Fig. 7i, j). 

We also checked whether SPN activity rates varied across the imaging field- 
of-view and thus affected estimates of coactivity levels. We computed the Pearson 
correlation coefficient between the Ca”* event rate of every cell and its distance 
from the centre of the field-of-view. We found a very weak but statistically signifi- 
cant correlation between these two variables (r= 0.07 + 0.02; P< 10719; n= 29,097 
cells; n = 19 mice, each sampled during five imaging sessions), which explained 
almost none of the variations in the Ca* activity rates of the cells across the SPN 
population (R? = 0.005). Overall, effects arising from the locations of the cells in the 
field-of-view were far too weak to account for the substantial distance-dependence 
of pairwise coactivity levels of the cells (Fig. 2d). 

We determined pairwise coactivity levels in the two-photon Ca”* videos using 
identical procedures as for the one-photon Ca?* videos. 

Spatial coordination index. In the Ca** videos, there were plainly evident, spa- 
tially correlated patterns of SPN activation at the seconds time scale. To charac- 
terize the spatial structure of these patterns at individual time points in the videos, 
we computed an index of spatial coordination. To calculate how this index varied 
over an entire imaging session (Extended Data Fig. 2), we took the binarized 
traces of the Ca”* events of individual cells and extended the number of time bins 
(200 ms each after temporal down-sampling; see ‘Determinations of coactivity 
levels in cell pairs’) in which a cell was marked as active so as to include all frames 
within 1s after each Ca”* event. We performed this temporal expansion, which 
increased the time window over which cells appeared as jointly active, to highlight 
the spatially clustered activation of cells at the seconds time scale. For each time 
bin, we then computed the set of distances between all pairs of cells that were 
active in the time bin. For each time bin, we also calculated this set of pairwise 
distances for shuffled versions of the data, in which we randomized the identities 
of the active cells while maintaining their centroid coordinates as well as the same 
number of active cells in the time bin. We performed 1,000 independent shuffles 
for each time bin. Aggregating the results across all 1,000 shuffles, we computed 
the cumulative distribution function of pairwise distances between active cells. For 
each time bin, we also computed this cumulative distribution function for the real 
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data, and performed a pair of one-tailed Kolmogorov-Smirnoyv tests that compared 
the cumulative distribution functions of pairwise distances in the shuffled and 
real versions of the data. With one of the two tests, we examined the hypothesis 
that the active cells in the real data were distributed closer together than expected 
by chance, and with the other, we examined the hypothesis that the active cells 
in the real data were distributed further away from each other than expected by 
chance. We took the logarithm (base ten) of the lesser of the two P values (that is, 
the P value from the test that more significantly rejected its null hypothesis). We 
multiplied the resultant number by negative one or one, respectively, according to 
whether the locations of the active cells in the time bin were spatially correlated 
or anti-correlated (as determined by which of the two Kolmogorov-Smirnov tests 
gave the more significant result). We assigned the spatial coordination index (SCI) 
the value of the resulting product. 

To characterize the time-dependent relationship between mouse speed and the 

SCI, we computed the time-dependent cross-correlation function between the two 
variables (Extended Data Fig. 2e, f). To analyse the effects of the temporal sampling 
rate on these calculations, before computing the SCI, we first forward-smoothed 
(see previous paragraph) the Ca”* event rasters using a variety of different smooth- 
ing time constants from 200 to 1,000 ms. After computing the speed-SCI cross- 
correlation function, we normalized it to the mean of its values between the inter- 
vals from —4 to —3s and from 3 to 4s. 
Analyses of Ca?* event rates as a function of locomotor speed. To examine how 
the rate of Ca?*+ events varied with changes in locomotor speed (Figs. 2a, 3e, 4b-d, 5c 
and Extended Data Figs. 4g, 7d, e), we sorted the individual image frames of the 
behaviour video of each mouse according to the instantaneous locomotor speed 
of the mouse. We then binned the image frames into sets of images for which the 
locomotor speed was the same to within 0.036-0.73cm sy using finer bins at 
lower speeds and coarser bins at higher speeds. For each image set, we computed 
its mean rate of Ca? events across all detected SPNs. 

To make comparisons between mice (Fig. 2a), we took the mean Ca?* event rate 
of each mouse as a function of locomotor speed and normalized this curve to the 
mean Ca** event rate of each mouse while it was at rest (speed <0.5cm s~!). We 
averaged these normalized traces across the set of all Drdla™ or Adora2a™ mice. 

To compare Ca”* event rates during the various drug treatments to those fol- 
lowing injections of saline vehicle (Figs. 4b-d, 5c and Extended Data Fig. 7d, e), 
we normalized the Ca** event rates for each mouse and each drug condition by 
the mean rate when the mouse was at rest after having been given saline vehicle. 
For each drug condition, we then averaged these normalized curves across all 
mice in the cohort. 

To compare Ca”* event rates before and after dopamine depletion (Fig. 3e), we 
first computed the curve of the mean Ca’* event rate of each mouse as a function 
of locomotor speed. We averaged these curves across mice of a given genotype, and 
then normalized the resulting trace to the mean rate when the mice were at rest in 
baseline recordings, before 6-OHDA lesion. For all comparisons, we calculated the 
s.e.m. for the mean Ca”+ event rate for each mouse and speed bin. 
Determinations of motion onset- and offset-triggered averages. We identi- 
fied onsets of locomotion by finding all movement bouts (>2s in duration) that 
occurred >25 after the prior movement bout. We identified offsets of locomotion 
by finding all movement bouts that preceded the next movement bout by >2s. 
To calculate motion onset- or offset-triggered averages, for each mouse we first 
normalized the speed-dependent, mean rates of Ca”* events or mean spatial coor- 
dination indices to the corresponding mean values from when each mouse was 
at rest during the period of —2 to —1s relative to motion onset. We then took all 
motion onset and offset occurrences and calculated the time-dependent, onset- 
and offset-triggered averaged traces of the normalized Ca** event rate or spatial 
coordination index. We averaged these traces across all mice in each cohort. 

To calculate the average times at which dSPN and iSPN activity reached their 
half-maximum values relative to the onset of movement, we fit logistic curves to 
the mean trace of the Ca”* event rate of each mouse for the interval of —2 to 2s 
relative to movement onset. We then determined the half-maximum activation 
time from the fit parameters. We averaged these time values over all mice of the 
same genotype. 

To calculate the average times at which dSPN and iSPN activity commenced 
relative to the onsets of locomotion on the running wheel (Extended Data Fig. 4h), 
we calculated the percentage of cells of each type that were active in each time 
bin in the binarized traces of Ca** activity (see ‘Detection of Ca”* transients’). 
Across all instances of motion onset, we computed the average number of dSPNs 
and iSPNs that were active in each time bin over the interval of —3 to 1s relative 
to motion onset. We then fit these averaged traces to a logistic curve. From each 
parameterized fit curve, we determined the onset time as the time at which the 
mean percentage of active cells was >3 s.d. of the percentage of active cells during 
the baseline periods of —3 to —1s relative to motion onset. 

Analysis of Ca** event-triggered averages. To examine how the occurrences of 
the Ca?* events of cells (in the one- or two-photon imaging datasets) correlated in 
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time with changes in the locomotor speed of the mice (Figs. 2b, 3h and Extended 
Data Fig. 4i), we computed the time trace of the Ca” event-triggered, mean loco- 
motor speed for each cell. For each mouse, we averaged these event-triggered 
traces of mean speed across all cells and normalized the resulting trace by its value 
at either —2s or —20s relative to Ca?* event onset, in the one- and two-photon 
imaging studies, respectively. In experiments with freely moving mice, we averaged 
these traces across all mice with the same genotype. In dual-colour imaging experi- 
ments with head-fixed mice, we averaged the traces across all SPNs of the same 
type. We computed the first temporal moment of the resulting averaged trace across 
the interval of —2 to 2s, to determine the temporal offset between peak locomotor 
speed and the occurrence of an SPN Ca”* event. 

Similarities of the neural ensembles activated by movements of different types. 
To statistically evaluate the relative similarity of the neural ensembles activated 
during movement bouts of the same or different types, and during baseline periods, 
we used the Jaccard similarity index (Extended Data Figs. 3d, e, 8d—g). Additional 
details can be found in the Supplementary Methods. 

Decoding mouse locomotor speed based on SPN Ca”" activity. We used a gene- 
ralized linear model (GLM) to make time-dependent predictions of the locomotor 
speeds of mice based on the AF(t)/F activity traces of the cells as determined by 
two-photon Ca’* imaging. Additional details are shown in Extended Data Fig. 4. 
Computational simulations of SPN activity traces. To examine the temporal 
accuracy with which we could use Ca”* events in SPNs to determine the times 
of the accompanying action potentials, we created and analysed artificial datasets 
based on the fluorescence responses to action potentials observed in our dual elec- 
trical-optical recordings in vitro and on basic facets of the fluorescence recordings 
in vivo. Additional details can be found in the Supplementary Methods. 

Data analyses and statistical tests. We performed data analysis using custom 
software written in MATLAB and ImageJ. We also used the Prism (Graphpad) 
and R programming environments for statistical analyses. We used two-tailed, 
non-parametric statistical tests to avoid assumptions of normal distributions and 
equal variance across groups. All signed-rank tests were Wilcoxon signed-rank 
tests. All rank-sum tests were Wilcoxon rank-sum tests. Analyses were performed 
unblinded. The sizes of our mice samples were chosen to approximately match 
those of previous work, as there was no pre-specified effect size. In all box-and- 
whisker plots, horizontal lines denote median values, boxes cover the middle two 
quartiles, and whiskers span 1.5x the interquartile range. P and n values for the 


statistical tests performed in all the figures can be found in the Supplementary 
Information. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Code availability. The algorithm used for image registration is available on its 
author’s website”. The algorithm used for cell sorting is available as Supplementary 
Material published elsewhere*’. Other software code that support the findings of 
this study are available from the corresponding authors upon reasonable request. 
Data availability. The data that support the findings of this study are available 
from the corresponding authors upon reasonable request. 
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Extended Data Fig. 1 | See next page for caption. 
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Extended Data Fig. 1 | Characterizations of striatal SPN Ca”* activity 
patterns in live brain slices and recorded in behaving mice using a 
chronic microendoscopy preparation. a, We performed whole-cell patch- 
clamp recordings of SPNs in acute brain slices, using dual epifluorescence 
and infrared Nomarski (IR-DIC) microscopy to guide the recordings. We 
selectively recorded from fluorescent dSPNs and iSPNs in brain slices from 
Drdla™ and Adora2a™ mice that had been injected into the striatum with 
AAV2/9-CAG-FLEX-GCaMP6m-WPRE.SV40. The numerical aperture 
used for the fluorescence recordings was matched to that used for Ca?* 
imaging studies in freely behaving mice (Methods). b, Illustrative traces 
(top) of neural membrane potential (V,,,) showing the changes that result 
upon stepwise injections of electrical current (bottom). The traces exhibit 
typical waveforms for SPNs. c, To visualize Ca** transients evoked by 
different numbers and frequencies of action potentials in iSPNs and 
dSPNs in brain slice recordings, we elicited spiking with 1-nA pulses 
(1-ms duration; 1 to 10 pulses, delivered at 5, 10, 20, 50 and 100 Hz). Top, 
an example of a Vn trace during 1-10 pulses injected at 10 Hz. Middle, 
example traces of current pulse injections delivered at 10 Hz. Bottom, a 
representative trace of somatic Ca”* transients in response to the same 
10-Hz current injections. d, Example Ca** activity traces for each current 
injection pattern tested in brain slices. As expected, electrical stimulation 
at higher frequencies evoked larger amplitude Ca”* transients, with more 
sharply rising Ca** waveforms than those evoked at lower stimulation 
frequencies. Arrows mark current injections. e, f, Mean + s.e.m. values for 
Ca’* transient amplitudes (e) and area under the Ca* transient waveform 
(AUG; f) as a function of the number of action potentials evoked, for 
dSPNs and iSPNs in brain slices. Values for each cell are normalized to 
those evoked at the highest stimulation intensity (10 action potentials, 

100 Hz). Ca?* event amplitudes rose linearly with the number of action 
potentials (R? > 0.99 for 5-20 Hz; R? > 0.97 for all frequencies). There 
were no significant differences in event amplitude or AUC between dSPNs 
and iSPNs, at any stimulus intensity. n =9 dSPNs and 8 iSPNs; two-way, 
mixed-design ANOVA. Exact P values can be found in the Supplementary 
Information for all extended data figures. g, h, Mean + s.e.m. values 

for half-rise (g) and half-decay (h) times (t;/2), measured for the Ca?* 
events evoked in dSPNs and iSPNs in brain slices, plotted as a function 

of the number of action potentials. Consistent with their more prolonged 
stimulus durations, lower stimulation frequencies yielded greater 


increases in the transient rise time. Ca”* transient decay times were nearly 
independent of action potential frequency or number. i, Mean + s.e.m. 
values of the AUC of stimulus-evoked Ca”* events in dSPNs and iSPNs 

in brain slices, under control conditions or perfused with SKF81297 
(11M) or quinpirole (101M), respectively. Neither drug significantly 
affected somatic Ca”* event AUCs in either SPN type, regardless of 
stimulus intensity. n = 6 dSPNs and 6 iSPNs; two-way, repeated-measures 
ANOVA. The individual data points are plotted alongside the mean 
values. j, Two example time-lapse fluorescence image series, acquired over 
the course of experiments in a Drdla‘” mouse (top) and an Adora2a“” 
mouse (bottom). Scale bars are 125 1m and apply uniformly to all panels 
within each row. Insets show example Ca** transients in the absence 

and presence of dopamine receptor agonist SKF81297 or quinpirole. 

Scale bars are uniformly sized for all insets. k, Surgical implantation of 
the microendoscope did not affect the ability of mice to improve their 
motor performance on the accelerating rotarod assay, as shown by their 
increased latencies to fall off the rod, which parallel the performance 
improvements of control mice without an implanted microendoscope. 
P=0.8; two-way, repeated-measures ANOVA; n = 13 control and 20 
implanted mice. Inset, Microendoscope implantation did not alter levels of 
spontaneous movement in an open field arena. P=0.7; n=5 control and 
6 implanted mice; Wilcoxon rank-sum test. Data points from individual 
mice are shown as open circles. l-n, Box-and-whisker plots of the rates 

(1), amplitudes (m) and full-width at half-maximum (FWHM) durations 
(n) of Ca?* events observed in live normal (untreated) mice in individual 
dSPNs and iSPNs, before (Pre-lesion) and after (Post 14-d) dopaminergic 
lesions, as the mice were resting (left) or moving (right). In resting mice, 
Ca** event rates decreased in dSPNs and increased in iSPNs after the 
lesion (1) as characterized in Fig. 3. When the mice were moving, Ca”* 
event rates in iSPNs were similar before and >14 days after the lesion, 
whereas the rates in dSPNs were depressed after the lesion. Each box-and- 
whisker plot is based on n = 3,325-3,703 dSPNs or iSPNs, from 12 Drdla‘® 
and 13 Adora2a™ mice, respectively, tracked before or 14 days after the 
6-OHDA lesion. Horizontal lines denote median values, boxes cover the 
middle two quartiles and whiskers span 1.5 x the interquartile range. Data 
denoted as ‘Pre-lesion’ were taken on day —5. Data denoted as ‘Post 14-@ 
were taken on day 14. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | The spatial coordination index is a metric of 
the extent to which the activated neurons at an individual time point 
are spatially clustered. a, Outline of the algorithm (Methods) used 

to compute the SCI at individual time points in the fluorescence Ca”* 
movies. For each time point in a AF(t)/Fy movie, we created an image 

of all cells that were coactive within 1 s of each other. A schematic of an 
example image (left) has a mixture of active (red) and inactive (white) 
cells. We computed the set of distances between all pairs of active cells in 
the image (middle). We then compared these values to the set of pairwise 
distances between active neurons in shuffled versions of the data from 
the same image, in which the identities of all the individual cells were 
randomly permuted while maintaining the same set of cell centroid 
locations. The shuffled dataset as a whole comprised 1,000 different 
random shuffles. We compared the cumulative probability distribution 
function (CDF) of all pairwise distances between active cells in the real 
image, to that for the shuffled data (right). We then tested two hypotheses, 
that the activated cells in the real image were either more spatially 
clustered than expected from a random pattern of activation, or less 
spatially clustered than expected from random activation. To test these 
two hypotheses, we performed a pair of one-sided Kolmogorov-Smirnov 
tests, comparing the real CDF values to those of the shuffled data. We 
determined the SCI by selecting the smaller of the two P values from 
these two tests, taking its logarithm, and assigning the sign of the index 
according to whether the CDF of the real data showed greater (positive 
sign) or less (negative sign) spatial clustering than the CDF of the shuffled 
data. b, Depictions of three images with different SCI values. The left 
image has an SCI >0, due to its many neighbouring coactive cells. The 
middle image has an SCI near zero, implying the activation pattern is 
consistent with that of a random distribution. The right frame has an SCI 
<0, indicating the active cells are further apart than generally expected of 


random activity. c, Examples of three image frames of different SCI values, 
from an actual Ca** movie acquired in a Drdla‘“ mouse. Active cells are 
shown enlarged and shaded white, whereas inactive cells are shaded grey. 
On the left is an image with multiple neighbouring coactive neurons and 

a high SCI. The middle image has a modest but positive SCI. The right 
image has a negative SCI, indicating that the activated cells are further 
apart than expected given a random pattern of activation. d, Distributions 
of pairwise distances between all pairs of active dSPNs for the three 
corresponding image frames shown in c. The distributions for the real 
data are shown in red, whereas those for the shuffled datasets are shown in 
grey. e, Mean time-dependent cross-correlation functions between mouse 
speed and the SCI of Ca’* activity in dSPNs (left) and iSPNs (right). 
Cross-correlation values are shown normalized to their values for abscissa 
values of +4 (Methods). We computed the SCI from Ca?* event traces 

to which we applied forward-smoothing, using one of five different filter 
values (coloured curves; Methods). Error bars are omitted for clarity in e, 
but are comparable to those shown in f. The centre-of-mass of the cross- 
correlation functions occurred at positive time lags (around 12-60 ms), 
indicating that rises in spatially coordinated Ca’* activity preceded 
increases in mouse speed (P < 0.06 for both genotypes and all filter time 
constants; Wilcoxon signed-rank test), but there were no significant 
differences across the two mouse lines (P > 0.05 for all filter time 
constants; Wilcoxon rank-sum test; n = 37 cross-correlation functions 
from 16 Drdla“ mice and n= 52 from 21 Adora2a™ mice). f, Mean cross- 
correlation functions between mouse speed and the SCI of Ca’* activity 
in dSPNs and iSPNs computed from Ca** event rasters forward-smoothed 
with a 1,000-ms time constant. Shading indicates s.e.m. e, f, Values taken 
from n= 16 Drdla“ mice and n= 21 Adora2a™ mice, aggregated over the 
1-h recordings on day —5 and the 30-min recordings performed on days 
—4 to —1 after saline vehicle injection but before drug administration. 
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Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | Different types of movements recruit distinct 
but overlapping populations of spatially proximal dSPNs and iSPNs. 
a, To study and compare movements of different types made by freely 
moving mice, we used custom software to extract from the behavioural 
videos the intervals from —4 to 4s surrounding the onset of each 
movement bout. Using this software, we manually labelled each bout 

as an instance of forward locomotion, a left or right turn, grooming or 
upward rearing. If the mouse made multiple types of movement within 
an individual bout, we labelled the bout according to the first movement 
type exhibited, as only the interval from —1 to 2s relative to motion onset 
was used in subsequent analyses of the accompanying neural Ca? activity. 
b, The fraction (mean +s.e.m.) of SPNs that exhibited Ca?* activity, 
relative to the baseline periods immediately preceding each movement 
type. Relative to baseline (dashed line), there was a significant increase in 
the fraction of dSPNs and iSPNs activated for all movement types except 
grooming. **P<5 x 10-3 and ***P<5 x 10-8 for dSPNs; ##P < 10-4 
and ###P < 10~!° for iSPNs; Wilcoxon signed-rank test. Data in b and 
care from n= 492 forward movements, 657 right turns, 810 left turns, 
732 grooming and 204 rearing bouts from 17 Drdla“* mice, and n= 790 
forward, 785 right turns, 1,015 left turns, 792 grooming and 164 rearing 
bouts from 21 Adora2a mice. c, Rates (mean +s.e.m.) of Ca?* events 
in dSPNs and iSPNs, plotted as a function of time relative to the onsets 
of different types of movements. Event rates are shown normalized to 
the values of —2 to —1s before motion onset and rose significantly above 
baseline values during all types of movement in both cell types. P< 10~® 
for both cell types and all movement types; Wilcoxon signed-rank test. 
d, Mean values of the neural ensemble similarity computed for the sets 
of dSPNs and iSPNs that were active during pairs of bouts of either the 
same (on-diagonal) or different (off-diagonal) types of movements 
(Methods). For all movement types, the similarities of the cell ensembles 
that were active on different bouts of the same movement type were 
significantly greater than those of the ensembles that were active during 
the baseline periods before each bout. #P < 0.05; Kolmogorov-Smirnov 
test, corrected for multiple comparisons using a Benjamini-Hochberg 
procedure with a false-discovery rate of 0.05. Off-diagonal asterisks 
indicate that the neural ensembles that were active during bouts of two 


different movement types were significantly less similar to each other 
than the ensembles activated on different bouts of the same movement 
type, for both of the two movement types under consideration. *P < 0.05; 
Kolmogorov—Smirnov test, corrected for multiple comparisons using a 
Benjamini—-Hochberg procedure with a false-discovery rate of 0.05. e, The 
cumulative distribution functions show the range of ensemble similarity 
values for the dSPN and iSPN ensembles that were activated on two 

bouts of the same movement type or on two bouts of different movement 
types (as described in d). For both iSPNs and dSPNs, ensemble similarity 
values were significantly lower for two bouts of different movement 

types than for two different bouts of the same movement type. P< 0.01; 
Kolmogorov-Smirnov test. f, Mean pairwise distances between the 
individual dSPNs or iSPNs activated on bouts of two different movement 
types. We normalized the values by comparing the mean weighted distance 
between the cells that were active during the two movement types to 

the same quantity determined under the null hypothesis that the spatial 
probability distributions of active cells on the two movement types were 
the same. For the latter determination, we created shuffled datasets in 
which we randomly permuted the firing rate of each cell between the two 
movement types, and we averaged the results over twenty-five different 
shuffled datasets. We normalized each distance value by taking the actual 
mean value, subtracting the mean value determined under the null 
hypothesis, and then dividing this difference by the standard deviation of 
the distance across the twenty-five shuffled datasets (Methods). Asterisks 
indicate that the mean pairwise distance was significantly greater than 
that expected by chance, indicating that the two movement types activated 
spatially distinguishable cell ensembles. *P < 0.05; Wilcoxon signed-rank 
test, corrected for multiple comparisons using a Benjamini-Hochberg 
procedure with a false-discovery rate of 0.05. Plots in d-f are based on 
n=102 and n= 126 comparisons between bouts of the same movement 
type, and 255 and 315 comparisons of bouts of different movement 

types, in Drdla‘ and Adora2a™ mice, respectively. Values are from 
n=17 Drdla™ mice and n=21 Adora2a™ mice, aggregated over the 1-h 
recordings on day —5 and the 30-min recordings performed on days —4 to 
—1 after saline vehicle injection but before drug administration. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | Simultaneous recordings of dSPNs and iSPNs 
show that the two cell types encode movement with indistinguishable 
spatiotemporal patterns of activation. a, To image dSPN and iSPN Ca?* 
activity concurrently, we prepared mice that expressed GCaMP6m in 
both cell types but tdTomato only in dSPNs (Methods). Mice were head- 
fixed on a running wheel beneath the objective lens of a two-photon 
microscope. A piezoelectric actuator moved the axial position of the 
objective to allow volumetric imaging across four planes at different 
depths of the tissue (15-\1m axial spacing). The mice were free to run or 
rest on the wheel (Supplementary Video 3). We tracked the motion of the 
wheel using a rotary encoder (500 encoder pulses per revolution) that 
provided a read-out of the instantaneous locomotor speed. We computed 
the mean speed of the mouse at a time-resolution matching that of the 
two-photon volume acquisition rate (6 Hz, or 166 ms per time bin). We 
identified periods of movement by marking all time bins in the mean 
speed trace with values >0.2 cms '. To identify instances of motion 
onset, we selected all time bins for which speeds were <0.2 cms! for at 
least 1 s in the immediately prior time bins, and >0.2 cm s~! for at least 

1s in the immediately subsequent time bins. To identify instances of 
motion offset, we used the opposite criterion. b, Histological section of 
dorsomedial striatum expressing tdTomato (red) in Drd1la“” positive cells 
and immunostained for GFP (green) to visualize GCaMP6m expression. 
Closed arrowheads point to three example dSPNs that expressed both 
GCaMP6m and tdTomato. Open arrowheads point to three putative 
iSPNs that expressed GCaMP6m but not tdTomato. Scale bar, 501m. 

c, Representative cell maps from each of the four imaging planes in an 
example mouse with 118 detected dSPNs (blue) and 183 detected iSPNs 
(red). Scale bars, 100m. d, Representative traces of Ca?* activity from 
10 dSPNs (blue) and 10 iSPNs (red) from the same mouse as in c. Grey 
shading here and in e, g, h denotes periods classified as movement on the 
running wheel. e, Locomotor speed on the running wheel (top) and Ca?* 
activity traces of individual dSPNs (middle) and iSPNs (bottom), during 
part of an imaging session in an example mouse. Note the clear correlation 
between locomotion and Ca?" activity in both cell types. f, Mean 
cumulative distribution functions of Ca”* event rates in dSPNs (n = 699) 
and iSPNs (m= 1020) were nearly identical, during periods of rest (left) 
and running (right). g, Mean Ca** event rates in dSPNs and iSPNs as a 
function of mouse locomotor speed. Events rates are shown normalized 
to their mean levels when the mice were resting (speed <0.2cms™!). 
Grey shading denotes speeds at which we classified the mouse as moving 
(>0.2cms~!). h, Mean locomotor speed (bottom) and the fraction of 
SPNs that are activated (top), plotted as a function of time relative to 
motion onset (left) and offset (right). We determined the onset time of 
neural activity as the time at which the mean percentage of active cells was 


>3s.d. of the percentage of active cells during the baseline periods of —3 
to —1s relative to motion onset. Using this criterion, dSPNs and iSPNs 
respectively activated —666 + 97 ms and —733 + 51 ms before motion 
onset (mean + s.e.m; n =5 mice; Methods), which were statistically 
indistinguishable. P= 0.9; Wilcoxon rank-sum test. i, Mean locomotor 
speed as a function of time relative to the occurrences of Ca* events in 
dSPNs and iSPNs. For each cell, we normalized these traces to the mean 
speed 20s before Ca?* excitation, then averaged the traces across all cells 
of each type and all mice. j, Mean pairwise coactivity (Methods) for dSPN- 
dSPN, iSPN-iSPN and dSPN-iSPN cell pairs, computed as a function of 
the distances between the pairs of cells, normalized to coactivity values 

in temporally shuffled datasets (1,000 distinct shuffles) in which time- 
correlated activity patterns were scrambled. Cyan shading indicates 
proximal (20-100 1m) cell pairs. Data are from periods of movement on 
the running wheel. k, Pairwise coactivity values (mean + s.e.m.) were 
significantly greater for proximal cell pairs than those in temporally 
shuffled datasets (*P <0.05; Wilcoxon rank-sum test; n =5 mice), and did 
not depend on the SPN types. n.s. denotes P > 0.05; Wilcoxon rank-sum 
test. Data points from individual mice are shown as open circles. 1, We 
created GLMs to make time-dependent predictions of mouse locomotor 
speed based on the AF(t)/Fp activity traces of the cells determined by 
two-photon Ca** imaging. We used the GLM libraries in MATLAB 
(Mathworks), using a Gaussian noise model and taking the identity 
function as the linking function. We used 70% of the time bins in the set 
of AF(t)/Fo traces for training the GLM, and 30% for testing it. To study 
the accuracy of the speed predictions as a function of the number of cells 
(n) included in the GLM, for each value of n, we randomly chose n cells 
from the total available, constructed the GLM and computed the Pearson’s 
correlation coefficient between the actual and predicted speed traces. 

For each n value, we repeated this procedure with 10 different randomly 
chosen subsets of cells and then computed the mean correlation coefficient 
of the real and predicted speeds across the 10 different sub-samplings. In 
an example mouse, traces of the actual running speed (grey) were well 

fit by models based on the activity of either dSPNs (blue) or iSPNs (red). 
m, Mean Pearson correlation coefficients for an example mouse between 
the actual and predicted running speeds, for GLMs based on either dSPNs 
or iSPNs (10 sets of randomly chosen cells for each abscissa value, out of 
301 total cells). Inset shows the correlation coefficients (mean + s.e.m.) 
from n=5 mice, computed using either 115 dPSNs or 115 iSPNs from 
each mouse. Data points from individual mice are shown as open circles 
and were statistically indistinguishable across the two cell types. P= 0.13; 
Wilcoxon signed-rank test. f-j, Colour shading denotes s.e.m. for n=5 
Drdla™ x Ail4 mice. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5 | 6-OHDA injections rapidly ablate SNc 
dopamine neurons and disrupt motor behaviour without impairing 
the fidelity of Ca”+ event detection. a, Example coronal brain section 
from an experimental mouse, immunostained for GCaMP6m (anti- 

GFP, green) and tyrosine hydroxylase (anti-TH, red). DAPI was used 

to stain the nuclei (blue). White lines demarcate the position of the 
implanted microendoscope, and the boundaries of nearby brain areas. 
AcbC, accumbens core; Cg, cingulate cortex; CPu, caudate/putamen; 

IC, insular cortex; M1 and M2, motor cortices; Pir, piriform cortex; $1 
and S2, somatosensory cortices. Labels are adapted from an anatomical 
atlas of the mouse brain*®. Scale bar, 1 mm. b, Representative midbrain 
coronal sections acquired 1 day after unilateral infusions of saline (top) 

or 6-OHDA (middle), or >14 days after 6-OHDA infusion (bottom), and 
then immunostained for tyrosine hydroxylase (red). DAPI was used to 
stain the nuclei (blue). Dopamine cell bodies are absent in the SNc of mice 
that received 6-OHDA, at both 1 and >14 days after 6-OHDA infusion. 
Scale bar, 500m. c, 24h after infusions into SNc of 6-OHDA but not of 
saline, mice exhibited disrupted patterns of spontaneous locomotion in 

an open field arena. P=4 x 107? for 6-OHDA and P=0.8 for saline; n =7 
saline-treated and 14 6-OHDA-treated mice; Wilcoxon signed-rank test 
for comparisons to the pre-lesion behaviour of each mice. d, The median 
fluorescence intensity across the entire imaging field (normalized to pre- 
lesion values on day —5 for each mouse and then averaged across mice) 
decreased significantly after 6-OHDA lesions in mice of both genotypes. 
P<10~’ comparing median fluorescence intensities (normalized to pre- 
lesion means) averaged across 5 days before versus 5 days after lesion; 
Wilcoxon rank-sum test. However, fluorescence intensities stabilized 
15-17 days after the 6-OHDA lesion, and there were no further significant 
changes over time in either mouse line. P> 0.05; n=5 Drdla™ and 

7 Adora2a™ mice; Spearman correlation. Error bars indicate s.e.m. 

for 5 Drdla‘”’ and 7 Adora2a™ mice. e, The number of active SPNs 

(mean + s.e.m.) detected in total on each day of the study, normalized to 
the mean value (dashed horizontal line) detected before 6-OHDA infusion 
(days shaded grey). On days on which mice received drug treatments 

(Fig. 1a), the data shown here are from the initial portions of the recording 
sessions before drug administration. The number of active cells was stable 
across the study, except for a single pairwise difference in the number of 
active iSPNs. Friedman ANOVA; n=5 Drdla‘’ and n=7 Adora2a™ mice; 
P>0.05 for Drdla‘ and P=0.01 for Adora2a‘; P=5 x 10~* for post 
hoc test comparing the number of iSPNs detected 1 day before the lesion 


and 15 days after the lesion; Fisher’s least significant difference test with 

a Holm-Bonferroni correction for multiple comparisons. f, Cumulative 
distributions of peak AF/F values for Ca?* events from individual 

dSPNs (top) and iSPNs (bottom), before and after 6-OHDA lesion. After 
the lesion, Ca** event amplitudes were significantly greater in dSPNs 

(3.7 £ 0.02% (pre-lesion) versus 6.0 + 0.03% (14 days after)), but smaller in 
iSPNs (4.4 + 0.02% (pre-lesion) versus 4.0 + 0.02% (14 days after)). Data 
are mean +s.e.m.; P< 107!” for both SPN types; Wilcoxon rank-sum test. 
Data in f and g are from n = 3,332-3,734 dSPNs or iSPNs, from before 
(day —5) or 14 days (day 14) after 6-OHDA lesion, in n= 12 Drdla™ mice 
and 13 Adora2a™ mice, respectively. g, Cumulative distributions of the 
signal detection fidelity (d’) of Ca”* events in individual dSPNs (top) and 
iSPNs (bottom), before and after 6-OHDA lesion. Owing to the decrease 
in background fluorescence intensity, Ca?* events in dSPNs became easier 
to detect after 6-OHDA lesion, as quantified by the increase in d’ values 
for dSPNs. d’ =17+0.1 (mean +s.e.m.; pre-lesion) versus 29 + 0.2 (14 
days after). The changes in d’ values for iSPNs were smaller in magnitude 
(d’=22+0.1 (pre-lesion) versus 19 + 0.1 (14 days after)), while keeping 
the optical conditions for Ca”* event detection extremely favourable. 
Crucially, all changes in Ca** event detection fidelity values were opposite 
to those observed for Ca?* event rates in the two cell types (Fig. 3), and 
thus cannot account for the event rate changes. h, d’ values for Ca?* 
events from individual dSPNs (left) and iSPNs (right), before and after 
6-OHDA lesions, and after drug treatments in mice following 6-OHDA 
lesion. d' values increased after 6-OHDA lesions and decreased after the 
drug treatments in dSPNs. By contrast, d’ values decreased after 6-OHDA 
lesions and increased after drug treatment in iSPNs. ***P < 107" for all 
conditions in both genotypes; Wilcoxon rank-sum test; values are from 
n= 1,770-2,027 dSPNs from 5 Drd1la“ mice and n = 1,719-2,393 iSPNs 
from 6 Adora2a“° mice, recorded before (day —5) and after 6-OHDA 
lesion (day 14), or after the lesion and quinpirole (day 16), SKF81297 

(day 18) and t-DOPA (day 20) treatments. As in g, these changes in 

d’ have an opposite sign to the changes in Ca’* event rates (Fig. 4), 

and thus cannot account for the latter effects. Throughout the study, 

d' values remained extremely high, in that a d’ value >16 corresponds 
mathematically to a mean rate of <10~!” errors in Ca** event detection 
per hour. This nearly vanishing predicted error rate is unattainable 
experimentally over many hours of recording but underscores the highly 
favourable conditions for Ca** imaging. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Lesion of dopamine neurons altered Ca”* event 
rates in dSPNs and iSPNs during spontaneous open field exploration 
and forced movement on the rotarod. a, b, Cumulative probability 
distributions of Ca”* event rates in dSPNs (left) and iSPNs (right) while 
mice were resting (a) or in locomotion (b) before the lesion of dopamine 
neurons, | day after the lesion (day 1), and >14 day after the lesion (day 14). 
Ca?* event rates in dSPNs were depressed at both time points after 

the lesion, during rest and locomotion. Ca*+ event rates in iSPNs were 
elevated at both time points after the lesion when mice were resting. 
However, when the mice were moving, iSPN activity rates were elevated at 
1 day but not >14 days after the lesion, compared to values from before the 
lesion. The distributions of Ca’* event rates in dSPNs in resting mice were 
nearly identical at 1 and >14 days, making it hard to visually distinguish 
the two plots. Data are from 12 Drdla™ mice and 13 Adora2a™ mice. 

The Drdla™ mice yielded a total of 2,554-3,732 activated dSPNs during 
movement and rest, across the different days of the study, whereas the 
Adora2a™ mice yielded 3,209-3,702 iSPNs. All comparisons within each 
SPN type between Ca** event rates from before the lesion to those 1 d 

and 14 d after the lesion were significant. P< 1073; Wilcoxon rank-sum 
test. c, Cumulative distribution functions of Ca?* event rates in individual 
dSPNs and iSPNs as mice walked on the rotarod. At 1 day after dopamine 
depletion (day 1), Ca?* event rates were reduced in dSPNs and increased 
in iSPNs. At >14 days after dopamine depletion (day 14), Ca”* event rates 
were still reduced in dSPNs but in iSPNs were at or even slightly below 
normal values. Data pre-lesion and >14 days after dopamine depletion 
are based on 5 Drdla mice and 7 Adora2a‘™ mice; however, | day after 
dopamine depletion only 2 Drd1a“* mice and 3 Adora2a™ mice could 
perform the rotarod assay without falling off, precluding determinations of 


Ca?* event rates from the other mice on that day. All comparisons within 
each SPN type between Ca’* event rates from before the lesion to those 1 
and >14 days after the lesion were highly significant (P< 10~'°; Wilcoxon 
rank-sum test), except for the comparison of iSPN activity rates from 
before the lesion to those >14 days after (P=5 x 107; Wilcoxon rank- 
sum test). d, Rates of Ca?* events (mean +s.e.m.) as mice walked on the 
rotarod, normalized to the corresponding values from before dopamine 
depletion. dSPN activation rates were substantially reduced at both time 
points after the lesion. iSPN event rates were substantially increased 

1 day after the lesion (day 1), but had returned to near baseline levels 

by 14 days after the lesion (day 14), mirroring the effects of dopamine 
depletion on dSPN and iSPN activity during spontaneous movement (see 
Fig. 3f). **P<5 x 10-3 and ***P <5 x 1077; Wilcoxon rank-sum test. 
Owing to the large numbers of Ca** events across thousands of SPNs 

that contributed to these calculations, the slight depression of Ca?* event 
rates in iSPNs at 14 days after the lesion was statistically significant. e, The 
extent of pairwise coactivation of dSPNs (left) and iSPNs (right) during 
the rotarod assay, as a function of the pairwise distance between cells, 
before dopamine depletion, at 1 and >14 days after dopamine depletion. 
Values are normalized to those attained from shuffled datasets in which 
the times of the Ca”* events of each cell were randomized (Methods). The 
spatial clustering of activity in proximal iSPN pairs (20-100 ym apart) 
was significantly reduced 14 days after the lesion, relative to values from 
beforehand. P< 107!*; Wilcoxon rank-sum test; n =7 Adora2a“ mice. 
Shaded areas denote s.e.m. Data and statistical tests in c-e are from 1,930, 
712 and 1,672 dSPNs and 1,731, 853 and 1,607 iSPNs that were active 
during the rotarod assay pre-lesion, 1 (day 1), and 14 (day 14) days after 
the lesion, respectively. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | In healthy mice, D1R or D2R antagonists affect 
SPN activity similarly to an acute (but not chronic) loss of dopamine 
neurons, whereas dopamine receptor agonists have divergent effects 
from those observed in dopamine-depleted mice. a—c, Example 
locomotor trajectories (15-min duration) of mice moving freely within 

a circular arena (a) and paired example traces of mouse locomotor 

speed and the mean rate of Ca”* events in dSPNs (b) and iSPNs (c) 

after administration of saline vehicle or a selective antagonist of D1Rs 
(SCH23390, 0.2 mg kg~') or D2Rs (raclopride, 1 mg kg™'). Both drugs 
reduced locomotion relative to vehicle. P= 10~4; Wilcoxon signed-rank 
test; n= 18 mice. d, e, Mean Ca?* event rates in dSPNs (d) and iSPNs 

(e) as a function of locomotor speed, following administration of saline 
vehicle, a D1R- or D2R-selective antagonist (0.2 mgkg' SCH23390 

or 1mgkg raclopride, respectively) (top) or a D1IR- or D2R-selective 
agonist (1 mgkg~' SKF81297 or 1 mgkg~! quinpirole, respectively) 
(bottom). Rates are normalized to cell population means when mice were 
at rest (<0.5cms~') after vehicle injection (Methods). Colour shading 
surrounding each curve denotes s.e.m. f, SPN Ca”* event rates during 
rest and movement after treatment with a dopamine-receptor antagonist 
or agonist. The DIR antagonist (SCH23390) reduced dSPN but not 

iSPN activity. The D2R antagonist (raclopride) increased iSPN but not 
dSPN activity. Both agonists (quinpirole, SKF81297) reduced activity in 
both SPN types during rest and movement. *P < 0.05, **P<5 x 1073, 
#4 P <5 x 10-7; Wilcoxon signed-rank test comparing all drug conditions 
to saline vehicle injection; n = 14 speed bins per mouse in the resting 
state (<0.5cms_') and 24 speed bins per mouse in the moving state 
(>0.5cms~!);n=7 Drdla™ and 11 Adora2a‘ mice. g, h, Mean Cat 
event rates in dSPNs (g) and iSPNs (h) relative to motion onset (left) 


and offset (right) following administration of saline vehicle, SCH23390 
or raclopride (top), or SKF81297 or quinpirole (bottom). Traces are 
normalized to cell population means averaged over the time period 

from —2 to —1s preceding motion onset. Raclopride increased dSPN 
(P=6 x 10-1!) and iSPN (P=2 x 107”) activation at motion onset. 
Wilcoxon rank-sum test; comparing 11 time bins for the interval of 0-2s 
after motion onset to the baseline periods in each of the 7 Drdla’ and 

11 Adora2a™ mice. SCH23390 slightly altered dSPN (P= 0.02) but not 
iSPN activation (P= 0.4) at motion onset. In Drdla‘” mice, there were 
n=1,917 (vehicle), 134 (SCH23390) and 129 (raclopride) instances of 
motion onset, whereas in Adora2a™ mice there were 2,798 (vehicle), 

155 (SCH23390) and 233 (raclopride) such instances. Colour shading 
surrounding each curve denotes s.e.m. i, j, Coactivity of proximal 
(20-100 m apart) dSPN (i) or iSPN (j) cell pairs during periods of 
movement, following administration of saline vehicle, SCH23390 or 
raclopride (top) or saline, SKF81297 or quinpirole (bottom). Coactivity 
values are plotted after subtraction of the coactivity values determined for 
temporally shuffled datasets under the same drug treatment conditions, 
and then normalized to the values attained for saline administration. 
Comparing measured values for each drug to those attained with saline; 
*P<0.05, **P <5 x 10-4 and ***P <5 x 1077; Wilcoxon signed-rank 
test; 8 spatial bins of cell-cell separation for each of the n=7 Drdla‘” and 
n=11 Adora2a™ mice. d-j, Data are based on the same 7 Drdla™ and 11 
Adora2a™ mice. Figure 1a shows the schedule of drug treatments. Data 
acquired in the 30-min Ca”* imaging sessions after drug administration 
have been normalized for each mouse to the values determined on the 
same day during the 30-min Ca* imaging sessions occurring after saline 
vehicle administration but before drug treatment. 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | Dopamine depletion reduces the movement- 
type specificity of iSPN activity patterns. a, Following dopamine 
depletion, we classified mouse movement bouts into different types, as 
in Extended Data Fig. 3. After 6-OHDA lesion, there were insufficient 
bouts of upward rearing to allow statistical analyses. b, Percentages of 
SPNs (mean +s.e.m.) that exhibited Ca** activity, relative to the baseline 
periods (dashed line) immediately before each movement type, >14 
days after 6-OHDA lesion. Compared to baseline periods, there was a 
significant increase in dSPN and iSPN activity for all movement types, 
except during grooming. *P < 0.05 and **P <5 x 10? for dSPNs; 

#P <0.05 and ##P <5 x 107? for iSPNs; Wilcoxon signed-rank test. Data 
in b and c are based on 206 forward movements, 200 right turns, 290 

left turns and 314 grooming bouts in 13 Drdla™ mice and 331 forward 
movements, 295 right turns, 194 left turns and 339 grooming bouts in 

21 Adora2a mice. c, Rates of Ca2* events (mean +s.e.m.) in dSPNs 
and iSPNs as a function of time relative to the onsets of different types 

of movements, >14 days after 6-OHDA lesion. Event rates are shown 
normalized to the values from —2 to —1s before motion onset. As found 
when all movements were grouped together (Fig. 3g), the rates of iSPNs 
hardly increase at motion onset in parkinsonian mice. d, Mean values of 
the neural ensemble similarity computed for the sets of dSPNs and iSPNs 
that were active during pairs of bouts of either the same (on-diagonal) or 
different (off-diagonal) types of movements, in freely behaving mice >14 
days after 6-OHDA lesion (Methods). For each pair of movement types, 
values are shown normalized to the corresponding value found in healthy 
mice before 6-OHDA lesion. Asterisks indicate a significant difference 
between the pre- and post-lesion values. *P < 0.05; Wilcoxon rank-sum 
test, corrected for multiple comparisons using a Benjamini-Hochberg 
procedure with a false-discovery rate of 0.05. e, Cumulative distribution 
functions showing the range of ensemble similarity values for the sets 

of dSPNs and iSPNs that activated on two bouts of different movement 


types. The similarity of the dSPN ensembles activated on different 
movement types decreased significantly after 6-OHDA, indicating that the 
representations of different movements became more distinct. P < 0.05; 
Kolmogorov—Smirnov test. By comparison, the isPN ensembles activated 
on different types of movements became more similar after 6-OHDA 
lesion, consistent with a reduction in the selectivity of iSPN movement 
encoding. P< 0.05; Kolmogorov-Smirnov test. f, Mean pairwise distances 
between the individual dSPNs or iSPNs activated on bouts of two different 
movement types, in freely behaving mice >14 days after 6-OHDA lesion. 
We expressed the values as z scores, as described in Extended Data Fig. 3f. 
For each pair of movement types, we then normalized each z score by 

the corresponding value found in healthy mice before 6-OHDA lesion. 

g, Cumulative distribution functions showing the normalized distances 
between dSPNs (left) and iSPNs (right) (computed as in f) before and 
after 6-OHDA lesion, for bouts of different movement types. The mean 
pairwise distances between iSPNs, but not dSPNs, that activated on bouts 
of different movement types decreased significantly after 6-OHDA lesion, 
indicating that the iSPN cell ensembles active on the different movement 
types were less spatially distinguishable after 6-OHDA lesion. P= 0.02; 
Kolmogorov-Smirnoyv test. Plots in d-g are based on n = 102 and n= 126 
comparisons between bouts of the same movement type, and 255 and 315 
comparisons of bouts of different movement types, in 17 Drdla® and 21 
Adora2a“ mice, respectively, before 6-OHDA lesion, and on n=72 and 
n=78 comparisons between bouts of the same movement type, and 180 
and 195 comparisons of bouts of different movement types in 12 Drdla‘” 
and 13 Adora2a™ mice, respectively, >14 days after 6-OHDA lesion. The 
pre-lesion data are the same as in Extended Data Fig. 3. The data >14 days 
after the lesion are aggregated from the 1-h recordings on day 14 plus the 
30-min recordings on day 20 that occurred after saline vehicle injection 
but before administration of 6 mgkg~' t-DOPA (see Fig. 1a). 
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Extended Data Fig. 9 | After unilateral lesion of SNc dopamine cells, 
L-DOPA and dopamine agonists induce a contralateral turning bias, 
and Lt-DOPA can also induce dyskinesia. a—c, Time traces showing 
mean +s.e.m. effects of different doses of L-DOPA, SKF81297 and 
quinpirole on the contralateral rotational bias of mice, scored in 

20-min time bins, >14 days after unilateral 6-OHDA lesion. All drug 
doses had their maximal behavioural effects approximately 30 min after 
drug administration. All dose-time interactions differed significantly 
from vehicle treatment (P <5 x 1073), and the effects of L-DOPA and 
SKF81297 were dose-dependent (P< 5 x 10-3). Two-way, repeated- 
measures ANOVA. d, Time traces of the mean + s.e.m. mouse rotational 
bias for the three different drugs in a-c, at doses of each drug that induce 
comparable levels of rotational bias during the time window used in our 
studies for Ca” imaging (shaded in grey), during which we assessed the 
effects of these drugs on dSPN and iSPN activity in parkinsonian mice 
(Fig. 4). e, Comparison of the net rotational bias in the first 40 min after 
administration of the three drugs at the same doses as in d. Note that, as 
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assessed by their effects on turning behaviour, these three compounds 
were Statistically indistinguishable at these doses. ‘ns’ denotes P > 0.2 for 
comparisons between drugs; n = 13 mice; Wilcoxon signed-rank test. 
However, all drugs had significant effects relative to vehicle treatment. 
**D <5 x 10-3; Wilcoxon signed-rank test; n = 13 mice. f-i, Time traces 
showing how different dosages of L-DOPA affect the mean + s.e.m. axial 
(f), orofacial (g), limb (h) and total (i) AIMS values, a behavioural measure 
of dyskinesia in preclinical models of Parkinson's disease*!. All three 
L-DOPA doses had significant effects compared to vehicle treatments 
(P< 0.02; two-way ANOVA) and differed significantly in their effects at 
the different dosages (P < 0.05). As in d, the grey shading in i denotes the 


time window used for Ca? 


imaging (Figs. 4, 5). j, L-DOPA increased the 


peak total AIMS values, in a dose-dependent manner, in the first 40-min 
after mice received the L-DOPA doses in f-i. *P < 0.05, **P<5 x 107*; 
Wilcoxon signed-rank test. Error bars in a-d and f-i indicate s.e.m. All 
data in a-j are from a distinct cohort of n= 13 freely moving mice not 


subject to the protocol of Fig. la. 
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Extended Data Fig. 10 | SPNs encode specific movements via the 
activation of spatial clusters of cells consisting of both dSPNs and 
iSPNs. a, Classical models of the basal ganglia, such as the rate model’, 
posit opposing roles for dSPNs and iSPNs in movement generation. 

A simple instantiation of these ideas (a ‘stop-go’ model'!) predicts 
increased dSPN and decreased iSPN activity at movement initiation, and 
decreased dSPN and increased iSPN activity at movement termination. 
This is depicted here schematically in the traces of aggregate SPN Ca?* 
activity in the model (middle), relative to motion onset and offset (top). 
However, in the actual data the activity of both SPN types increases at 
motion onset and decreases at motion offset (bottom). b, Other types of 
models are more compatible with the concurrent activation of both SPN 
types at motion onset. In ‘suppression-selection’ models'*, dSPNs select 
a motor program, while iSPNs simultaneously suppress competing motor 
programs. In these models, specific subsets of dSPNs and iSPNs coactivate 
to select and suppress specific motor programs, respectively. Here we 
depict this idea using schematic traces of single-cell activity. dSPNs that 
encode either ‘type A’ or ‘type B’ movements activate selectively during 
one or the other type of movement (see top trace of movements in a). 

By contrast, isPNs, which suppress competing movements such as those 
of ‘type C’ and ‘type D;, activate on both ‘type A and ‘type B’ movement 
bouts. A prediction of this model is that the cell populations encoding 
different types of movement are more similar (that is, less selective), and 
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possibly more broadly distributed anatomically, for iSPNs than dSPNs. 
However, the actual data reported here show that the selectivity and spatial 
distributions of neural responses during movement are comparable in 
both SPN types (Fig. 2c-e and Extended Data Figs. 3d-f, 4j, k). c, More 
specifically, we found action-selective ensembles in both dSPNs and 
iSPNs (Extended Data Fig. 3). Unlike the suppression-selection model 

in b, our findings suggest that dSPNs and iSPNs coordinate to select 
motor programs, depicted here as traces of single-cell activity in which 
movement ‘type A and ‘type B’ encoding dSPNs and iSPNs activate 
selectively on bouts of movement types A and B, respectively (see top 
trace of movements in a). Furthermore, the dSPNs or iSPNs activated for 
one movement type were physically closer to each other than to SPNs 
activated for different movement types (Extended Data Fig. 3f). These 
analyses quantify what is visually apparent in the Ca”* videos of striatal 
activity, which show that multiple, segregated clusters of coactive dSPNs 
or iSPNs activate at the onset of motion (Supplementary Videos 1, 2). 
Finally, using two-photon microscopy to image the simultaneous patterns 
of Ca** activity in dSPN and iSPN ensembles, we found that the coactivity 
of nearby dSPN-iSPN pairs was indistinguishable from that of pairs of 
the same SPN type (Extended Data Fig. 4). Combining these findings, our 
data show that different subsets of spatially intermingled dSPNs and iSPNs 
activate for distinct types of movement (for example, ‘type A and ‘type B’ 
movements, bottom depictions of ‘spatial distributior). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


a Normal b Parkinsonian c Dyskinetic 
dSPNs iSPNs dSPNs iSPNs dSPNs iSPNs 
ee ee ee ee 
Spontaneous ee ||P *% ee ee e * 
resid e e ee e e@ ee 
neural activity 
cai | es . ee ce ec ee ° @ Active dSPN 
ee ry eee8 @ Active iSPN 
: e e e e 
Motion-related ee ee ee ee @ e ee ee 
neural activity ase ose a. e %e eee oo 
@ e e 
C) ee ee ee 
d Spatial e f 
distribution of 
neural actvty 
Motion- sf 
neural say | 
Spontaneous: A Striatum 
neural ee 
eT Lv aati Md - 
|| Indirect (D2) 
aie fs pe a” 
4 Excitatory 
Y y GPi/SNr 


Motor ession ——y 
suppression 


Motion he teased ---- ie the - ----- ---- 
Motor program 
activation 


Balanced activation 
& supression 
Specified motion 


Extended Data Fig. 11 | Implications for basal ganglia dysfunction 

in parkinsonian and dyskinetic states. a~c, Schematic depictions of 
the spatial organization of SPN activity. Active cells are shown as filled 
circles. a, Prior to dopamine loss both dSPNs and iSPNs exhibit spatially 
clustered patterns of coactivation when mice are resting. Clusters of 

the two cell types are spatially overlapping (Extended Data Figs. 4, 10). 
During movement, both cell types undergo an increase in the number 
of active cells and extent to which activity is spatially clustered. b, After 
dopamine loss dSPN activity decreases in rate but remains spatially 
clustered in resting mice (Fig. 3e, j). Moreover, dSPNs still coactivate 

in a spatially clustered manner during movement initiation (Fig. 3g). 
The rate of iSPN activity is increased after dopamine depletion in 
resting mice (Fig. 3e). However, iSPNs are less responsive at the onset 
of and throughout movement, and their motion-related activity shows 
almost no spatial clustering (Fig. 3g-j). c, During t-DOPA-induced 
dyskinesia (LID), dSPNs are hyperactive, unresponsive to movement, 
and spatiotemporally decorrelated (Fig. 5c-h), much like iSPNs after 
dopamine loss. During LID, iSPNs are hypoactive but retain their 
movement-related activation and spatial clustering (Fig. 5c—h), much 
like dSPNs after dopamine depletion. d-f, Iconic depictions of the SPN 
activity patterns during normal, parkinsonian and dyskinetic states, and 
the implications for downstream basal ganglia function. Icons are inspired 
by those in previous work. For each icon in the top row, the height of 
the rectangular pedestal represents the amplitude of spontaneous SPN 
activity in resting mice. The height and width of protrusions above the 
pedestals respectively denote the amplitude and spatial distribution 

of motion-related SPN activity. On the basis of the known excitatory 
and inhibitory projections between basal ganglia nuclei, the icons in 
subsequent rows show the predicted consequences for the downstream 
neural targets. In these icons, indentations into the rectangular pedestal 
denote inhibitions of neural activity during active movement. The globus 
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pallidus pars externa (GPe) and the sub-thalamic nucleus (STN) are 

in the basal ganglia’s indirect pathway and receive signals from striatal 
iSPNs. In the direct pathway, dSPNs project directly to the globus pallidus 
pars interna and substantia nigra pars reticulata (GPi/SNr), where 

signals from the two basal ganglia pathways converge (purple icons). 
GPi/SNr activity influences motor program selection by modulating 
downstream thalamacortical neurons and brainstem motor nuclei (not 
depicted). d, Before dopamine depletion, GPi/SNr cells receive signals 
during movement from the direct and indirect pathways that should 
induce, respectively, spatially structured patterns of neural activation and 
inhibition in GPi/SNr. We hypothesize that the convergence and spatial 
patterning of these bidirectional modulations enhance the specificity of 
the disinhibitory signals transmitted from GPi/SNR (left purple icon). In 
this view, the spatially coordinated, joint modulation of activity in 
GPi/SNr by both dSPNs and iSPNs may be crucial for well-choreographed 
motor programs'>. e, After dopamine depletion, spontaneous activity is 
reduced in dSPNs and increased in iSPNs. The rate model predicts that 
these changes in spontaneous activity lead to increases in spontaneous 
GPi/SNr activity (middle purple icon), tonically suppressing movement. 
Dopamine depletion also abolishes the spatially clustered, motion-related 
activation of iSPNs. This added deficit in motion-related iSPN activity is 
unaccounted for by the rate model and may upset the normal bidirectional 
spatial patterning of GPi/SNR output activity. This in turn may reduce 

the specificity of GPi/SNr inhibition, causing abnormal movement 
specification and motor coordination. f, During LID, spontaneous activity 
rises in dSPNs and declines in iSPNs. The rate model predicts that these 
changes should tonically drive movement by suppressing GPi/SNr activity 
(right purple icon). LID also disrupts the amplitude and spatial structure 
of motion-related dSPN activity. This decorrelated dSPN hyperactivity 
may disrupt the normally focused suppression of GPi/SNr activity during 
movement, resulting in increased but less specified movements. 
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A midline thalamic circuit determines 
reactions to visual threat 


Lindsey D. Salay!, Nao Ishiko! & Andrew D. Huberman!*3;4# 


How our internal state is merged with our visual perception of an impending threat to drive an adaptive behavioural 
response is not known. Mice respond to visual threats by either freezing or seeking shelter. Here we show that nuclei of the 
ventral midline thalamus (vMT), the xiphoid nucleus (Xi) and nucleus reuniens (Re), represent crucial hubs in the network 
controlling behavioural responses to visual threats. The Xi projects to the basolateral amygdala to promote saliency- 
reducing responses to threats, such as freezing, whereas the Re projects to the medial prefrontal cortex (Re—-mPFC) to 
promote saliency-enhancing, even confrontational responses to threats, such as tail rattling. Activation of the Re—-mPFC 
pathway also increases autonomic arousal in a manner that is rewarding. The vMT is therefore important for biasing how 
internal states are translated into opposing categories of behavioural responses to perceived threats. These findings may 
have implications for understanding disorders of arousal and adaptive decision-making, such as phobias, post-traumatic 


stress and addictions. 


Vision is a rich source of information about the presence of impending 
threats, such as predators or heights. To ensure survival, visual threats 
must be perceived and integrated with the neural circuits that control 
internal states (for example, arousal level) in order to engage the appro- 
priate behavioural responses. However, where in the brain this inte- 
gration occurs is unknown. Adaptive behavioural responses to visual 
threats fall into two general categories: those that reduce saliency to 
predators (to avoid detection), such as freezing or hiding’, and those 
that enhance saliency (to avoid capture), such as fleeing or retaliatory 
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Visual threat activates midline thalamic nuclei 

Fear, which manifests as a sudden shift towards freezing or hiding, 
can be evoked in mice by exposing them to a dark, rapidly expanding, 
overhead ‘looming’ stimulus that resembles an approaching predator! 
(Fig. 1a, Supplementary Videos 1 and 2). Fear does not occur in the 
absence of looming, or if looms are presented in the lower visual field! 
(Fig. 1b). In response to overhead looming, mice (n= 15, controls) 
spent 91.9% of the loom presentation time either freezing, or hiding in 
the provided shelter. For the remaining fraction of time (8.1%), mice 
were either ambulatory (walking; 6.4%) or running (1.7%), the vast 


Fig. 1 | Visual threat activates the ventral 
midline thalamus. a, Ethogram of responses 
to a looming threat. Black circles represent 
the looming stimulus (15 expansions in 24s). 
b, Behaviours in response to looming from 
above, no loom and looming from below. 

c, d, Looming from above, but not below, 
induces vMT c-Fos activation (c) and Xi 
activation (d) relative to no loom. e, Location 
of the ventral midline thalamus within the 
mouse brain. f-h, c-Fost (white) neurons in the 
vMT after overhead looming (f), no looming 
stimulus (g) or looming from below (h). Scale 
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bars, 100 jum. CM, central medial nucleus; 
IAM, intrantereomedial nucleus; Re, nucleus of 
reuniens; Xi, xiphoid nucleus. Data are mean + 
s.e.m. *P< 0.05, **P< 0.01, ***P< 0.001; NS, 
not significant. See Supplementary Table 1 for 
further details of the statistical analyses. 
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Fig. 2 | VMT neurons modulate visual threat responses. a—c, Diagrams 
depicting different stages of the experimental protocol. i.p., intraperitoneal. 
d, hM4Di-mCherryt cells (red) in the vMT. e, A zoomed-in view of 

the region outlined by a yellow square in d. Scale bars (d, e), 100 jum. 

f-h, Ethograms of responses to overhead looming in control mice treated 
with CNO (f), mice with vMT inactivation (g) and mice with VMT 
activation (h). i-m, Effects of vVMT activation or inactivation on the 
behavioural responses of mice. vMT inactivation eliminated tail rattling 


majority of which was in the direction of the shelter in order to hide. 
Control mice spent a very small portion (2%) of the total overhead 
loom presentation time ‘tail rattling, a behaviour typically associated 
with aggression® ®. Notably, all of the tail rattling occurred when the 
mice were hiding under the shelter, not in the open, unsheltered por- 
tion of the arena (Fig. 1a). 

To parse the neural circuits that determine behavioural responses 
to threatening visual stimuli, we examined c-Fos, a marker of recently 
active neurons, in the brains of mice recently exposed to overhead 
looming. Neurons in the vMT were highly activated by looms pre- 
sented overhead, but not by looms presented below (Fig. 1c-h), with 
the greatest activation occurring in the Xi (Fig. 1d-h). Because the VMT 
resides outside the primary visual pathway, it may represent an impor- 
tant station for modulating and/or promoting behavioural responses 
to visual threats. 


vMT neurons modulate threat responses 

To further assess the functional role of the vMT, mice were stereotaxi- 
cally injected with an adeno-associated virus (AAV) expressing hM4Di 
(Gijo-coupled human muscarinic M, designer receptor exclusively acti- 
vated by a designer drug (DREADD)) tagged with mCherry (AAV- 
hSyn-hM4Di-mCherry) (n=9 mice; Fig. 2a—-e). Control mice (n=9) 
received injections of a virus lacking DREADD (for example, AAV- 
hSyn-mCherry). Other mice (n= 15) received injections of AAV-hSyn- 
hM3Dq-mCherry (G,-coupled human muscarinic Mj; DREADD) into 
the vMT. hM4Di and hM3Dq receptors bind clozapine N-oxide (CNO) 
and cause robust decreases or increases, respectively, in the firing of 
central-nervous-system neurons™’° (Fig. 2b). vMT neurons expressing 
hM4Di and exposed to CNO showed a significant decrease in c-Fos 
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(i-]); VMT activation increased tail rattling (i, duration; j, incidence; 

k, percentage of mice) and running (i, duration; j, incidence) in the open 
arena (1, m). vMT activation increased ambulatory time in the open (i) 
and total motile behaviours (n) relative to control mice. o-s, Cumulative 
frequency distribution plots of behavioural responses. Data are mean 
+s.e.m. *P< 0.05, **P< 0.01, ***P < 0.001; NS, not significant. See 
Supplementary Table 1 for further details of the statistical analyses. 


activation, whereas a significant increase in c-Fos activation occurred 
in VMT neurons expressing hM3Dq (Extended Data Fig. 1). 

Marked differences in responses to looming were observed between 
the three groups of mice. Control mice displayed normal ratios of 
behaviours, with freezing and hiding accounting for the vast majority 
of the loom-stimulus presentation time (Fig. 2f, i, o-1, Extended Data 
Fig. 2). Mice in which hM4Di-mCherry was expressed in the VMT 
and CNO had been administered (that is, mice in which the VMT was 
inactivated), showed normal responses to overhead looming stimuli by 
freezing or running to hide (Fig. 2g, i, p-r); however, the overall num- 
ber of mice exhibiting tail rattling was reduced from 3 out of 9 control 
mice to 0 out of 9 vMT-inactivated mice (Fig. 2f, k, Supplementary 
Video 3). 

Considerable shifts in the behavioural responses to overhead looms 
were observed in mice that expressed hM3Dq and were treated with 
CNO (that is, mice in which the vMT was activated). Although the 
mice hid at normal frequency (Fig. 2h, i, r), they exhibited significant 
increases in the number (Fig. 2h, j, o) and duration (Fig. 2i) of tail- 
rattling episodes, most of which occurred outside the confines of 
the shelter (Fig. 2h, 1, Supplementary Video 4). The overall percen- 
tage of mice exhibiting tail rattling was also significantly increased 
(Fig. 2k). Tail rattling is associated with aggression in male mice before 
fighting”"! and in female mice eliciting maternal territorial behaviour’, 
although it can reflect non-aggressive states as well!!!. We observed 
significant increases in tail rattling in both male and female mice, with 
no sex differences (Extended Data Fig. 2). Although the emotional state 
associated with tail rattling is unclear, it unequivocally increases the 
visual and auditory saliency of the mouse to predators (Supplementary 
Video 4). Mice treated with both hM3Dq and CNO also displayed 
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Fig. 3 | Divergent vMT outputs influence threat responses. a—d, Mice 
were injected with AAV-GFP in the vMT (a, c; n= 17 mice) and axons 
(green) were observed in the BLA (b) and mPFC (d). e, Locations in the 
brain at which retrograde tracing was performed. f-h, Retrograde labelled 
vMT-—+BLA (red, f), VMT—mPFC projection neurons (green, g), and 

the merged image (h); n=5 mice. i, j, Injections (i) and representative 
image of hM3Dq/Cre* neurons (red) in the vMT, projecting to the BLA 
(j). k, Ethogram of the responses to looming in mice when the VMT—BLA 
neurons are activated. 1, m, Injections (1) and representative image of 
hM3D/Cre* neurons (red) in the vMT that project to the mPFC (m). 


significantly more walking (duration) and running (incidence), with 
the vast majority of this locomotion occurring within the open arena 
(Fig. 2 i, j, m, p). Overall, we observed a significant shift towards more 
motile behaviours in mice treated with both hM3Dq and CNO com- 
pared to control mice (Fig. 2n, s), and a shift away from immotile, 
saliency-reducing behaviours (for example, freezing and hiding). 
No increase in the amount of motile behaviour occurred in the absence 
of visual threats (Extended Data Fig. 6). Therefore, vMT influences 
action selection in response to threats, but does not influence locomo- 
tor behaviour in itself. 


Distinct threat responses by vMT outputs 

Next we explored the structure and function of vMT output circuitry. 
We injected AAV-XFP into the vMT (n= 12 mice; Fig. 3a, c, Extended 
Data Fig. 3a, b) and two weeks later, collected and analysed the brains 
(Fig. 3a-d). The two major outputs of the vMT were the basolateral 
amygdala (BLA) (Fig. 3a, b) and the medial prefrontal cortex (mPFC) 
(Fig. 3c, d). Two-colour retrograde tracing (Fig. 3e) from these targets 
revealed that they originate from non-overlapping, spatially compart- 
mentalized sets of VMT neurons. The VMT—BLA pathway originated 
from neurons in the Xi nucleus (Fig. 3f, h), whereas the vMT—-mPFC 
pathway originated from the Re nucleus (Fig. 3g, h). 

The rodent BLA and mPFC have been shown to be involved in a 
range of functions including the control of fear'*!° and anxiety-related 
behaviours'® |’. Given the divergent outputs of different VMT neurons 
to each of these targets (Fig. 3e—h), we investigated how each path- 
way affects the responses to looming stimuli. To study the vVMT—-BLA 
pathway, we injected the BLA with a canine adenovirus-2 expressing 
Cre-recombinase (CAV-Cre), which infects synaptic terminals and thus 
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n, Ethogram of the responses to looming in mice with VYAT—mPFC 
neurons activated. o, p, Schematic of injections (0) and ethogram of 
responses to looming in mice with vVMT axon terminals activated in 

the mPFC (p). q, r. Activating the VMT—mPEC pathway, but not the 
vMT-—BLA pathway, increases tail rattling (q, duration; r, incidence) 
relative to controls. BLA, basolateral amygdala; mPFC, medial prefrontal 
cortex; PL, prelimbic cortex; ACC, anterior cingulate cortex. Scale bars, 
100 jum. Data are mean + s.e.m. *P< 0.05, **P < 0.01; NS, not significant. 
See Supplementary Table 1 for further details of the statistical analyses. 


‘retrogradely’ infects neurons with Cre’’. We then injected AAV-hSyn- 
DIO-hM3Dq-mCherry (Fig. 3i) into the vMT of the same mice, thereby 
inducing expression of hM3Dq-mCherry selectively in Cre-expressing 
vMT-—BLA neurons (n =8 mice; Fig. 3j); we then treated the mice with 
CNO to activate vVMT—BLA circuitry. This caused a marked increase 
in the duration of freezing in response to looms, and a reduction in the 
amount of time that the mice spent hiding (Fig. 3k, q). No increases 
in tail rattling were observed after activation of VMT—BLA neurons 
(Fig. 3k, q, r). 

In other mice (n = 8) we injected the mPFC, with CAV-Cre, followed 
by an injection of AAV-hSyn-DIO-hM3Dq-mCherry into the vVMT 
(Fig. 31); this resulted in specific infection of the hM3Dq-mCherry/ 
Cre* neurons that project to mPFC (VMT—mPFC; Fig. 3m, Extended 
Data Fig. 4). We then injected these mice with CNO to selectively 
activate VMT—mPFC circuitry. This caused a significant increase in 
tail-rattling events compared to controls, but did not change freezing 
or hiding behaviours (Fig. 3n, q, r). 

In both the VMT—mPFC.-activated and the VMT—mBLA-activated 
mice, we observed mCherry* axons in the nucleus accumbens (NAc), 
suggesting that axonal projections from the vMT bifurcate to innervate 
the NAc. Anterograde tracing from the vMT with AAV-GFP confirmed 
this (Extended Data Fig. 3a—-d). We tested whether the VMT—NAc 
pathway altered behavioural responses to looming (Extended Data 
Fig. 3e, f), but found that it did not (Extended Data Fig. 3g). Therefore, 
the VMT—mPFC and the VMT—BLA circuits determine the behav- 
ioural responses to visually threatening stimuli. 

To specifically activate the VMT—mPFC pathway and not the 
bifurcated NAc axons, we injected AAV-ChR2-eYFP (ChR2, channel- 
rhodopsin-2; eYFP, enhanced yellow fluorescent protein)”° into the 
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Fig. 4 | Saliency-enhancing behaviours to threat persist after VMT 
stimulation. a, Representative image of ChR2-eYFP* cells in the vMT. 
Scale bars, 100 xm. b-d, Ethograms of responses to looming in control 
mice (b), mice with vMT activated in concert with the loom (c, co- 
activated) and mice with vMT activated before the loom (d, pre-activated). 
e-g, Co-activating and pre-activating the vMT increased tail rattling and 
running (e, duration; f, incidence). Co-activating the vMT decreased 


vMT and implanted a fibre-optic cannula above the mPFC (Fig. 30). 
Upon optogenetic stimulation of the VMT—mPFEC pathway, 100% of 
mice exhibited tail rattling (Fig. 3p, Extended Data Fig. 5). The stimu- 
lation of VMT—mPFC terminals significantly increased tail rattling 
(duration and incidence) and running (incidence) (Fig. 3p-r). 
Collectively, these data reveal that each of the two divergent outputs 
of the vMT determines a unique set of responses to visually threaten- 
ing stimuli: the VMT—mPFC circuit increases tail rattling and motile 
behaviours, and the VMT—BLA circuit suppresses these behaviours 
and encourages freezing instead, effects that do not simply reflect dif- 
ferences in the number of infected neurons in each condition (Extended 
Data Fig. 4a—d). 


Impact of vMT activation prior to loom exposure 

To understand further how the vMT affects behavioural response 
selection to visual threats, we again used virally-expressed ChR2 
(n = 32 mice; Fig. 4a) and varied the timing of vMT activation rel- 
ative to loom presentations. In the first condition, the VMT was 
stimulated at the same time as the presentation of the loom stimulus 
(Supplementary Video 5). Control mice with AAV-GFP expressed 
in the vMT, but no ChR2, showed normal loom responses (n = 17 
mice; Fig. 4b, Extended Data Fig. 2). By contrast, mice with ChR2 
expressed and blue light stimulation presented in concert with the 
looming stimulus exhibited significant increases in both the duration 
and incidence of tail rattling and running (Fig. 4c, e, f). Overall, vVMT 
stimulation together with looming significantly increased saliency- 
enhancing behaviours, and decreased saliency-reducing behaviours 
such as freezing (Fig. 4e, g). 

Next we asked whether vMT stimulation before looming also induces 
shifts in the behavioural responses of mice. We stimulated the VMT 
for 30s before any exposure to looming stimuli (Fig. 4d) (n= 15). This 
induced a remarkably similar pattern of effects to VMT stimulation 
in concert with looms; in both cases, the mice displayed significantly 
increased saliency-enhancing behaviours, such as tail rattling and 
running, compared to controls (Fig. 4e-g). This suggests that vMT 
stimulation causes a shift in the internal state of the mouse and does 
not somehow alter visual perception in itself. 
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freezing (e) and increased total motile behaviours (g). h, Schematic of the 
overhead-sweep-stimulus experiment. i-j, Ethograms of responses to the 
overhead sweep stimulus in control mice (i) and mice in which the VMT 
is activated during the sweep (j). k, Co-activating the vMT decreased 
freezing in response to the sweep. Data are mean + s.e.m. *P < 0.05, 
**P< 0.01, ***P < 0.001. See Supplementary Table 1 for further details of 
the statistical analyses. 


To further test the idea that vMT activation shifts mice towards states 
that favour saliency-enhancing responses to threat, we exposed a subset 
of mice to a visual ‘sweep’ stimulus, which reliably induces freezing 
but not escape and/or hiding behaviours”! (Fig. 4h, i, k). Activating 
the vMT with ChR2 and blue light during the presentation of visual 
sweeps significantly reduced the amount of sweep-induced freezing and 
promoted saliency-enhancing behaviours (Fig. 4j, k). ChR2-induced 
activation of the vMT also significantly reduced freezing and avoidance 
of olfactory threats”””3, and increased saliency-enhancing ambulatory 
and approach behaviours (Extended Data Fig. 6a-c). 


vMT activation increases arousal level 

The fact that vMT activation causes robust shifts in the responses 
to fear-inducing stimuli even when it is induced before the stimuli 
are delivered suggests that vMT activation modulates arousal 
state. To explore this, we examined the size of the pupil in response 
to illumination at constant light levels”*, which is a fast read-out of 
arousal*>®, 

Heightened arousal increases pupil diameter”°. We explored how 
vMT stimulation affects pupil size at constant 100-lx ambient light. 
vMT stimulation resulted in a significant increase in pupil size (60% 
change) compared with that of control mice (Fig. 5a—c). Even after 
vMT stimulation was ceased, the pupil remained significantly dilated 
(80%; Fig. 5c). To separate the arousal-driven from the light-driven 
component of the pupil reflex, we tested the consensual’ reflex of the 
pupil in one eye, in response to light pulses delivered to the opposite 
eye in otherwise dark-adapted mice. Mice in which vMT activity was 
enhanced showed normal pupil constriction in response to light, but in 
the dark showed significantly increased pupil dilation compared to con- 
trols (Fig. 5d, e). Increased dilation was also observed in mice in which 
the VMT—mPFC pathway was stimulated, but not in mice in which 
the VMT—BLA pathway was stimulated (Fig. 5f). Sham stimulation 
and CNO alone did not affect pupil size (Extended Data Fig. 7). vMT 
activation also caused a significant increase in heart rate (Fig. 5g, h) 
and a trend towards increased breathing rate (Fig. 5i). Thus, 
pan-vMT and vVMT— mPFC activation does not affect light-driven 
pupil responses; rather, it increases arousal. 
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Fig. 5 | vMT activation increases arousal. a, Schematic of experimental 
set-up to analyse arousal-driven pupillary responses to vMT stimulation. 
b, c, Pupil size in constant light (100 lx) conditions before, during and after 
vMT stimulation (n = 12 mice) relative to controls (n= 8 mice, dashed 
line in c). d, Schematic of experimental set-up to analyse light-driven 
pupillary responses in dark-adapted mice. e, Relative pupil size before, 
during and after a 30-s light pulse (1000 Ix) in mice with and without vVMT 
stimulation (n= 11 mice). f, Under constant light conditions, activating 
the VMT—mPFC pathway, but not the vVUT—BLA pathway increased 
relative pupil size. gi, Heart rate (g, h) and breathing rate (i) in mice with 
and without vMT stimulation (n= 8 mice). j, Illustration of the RTPP 

test. k, Representative tracing of a mouse with vMT stimulation in the 
RTPP test. 1, Mice with vMT activation (n= 14) spent more time in the 
stimulated side as compared to control mice (n= 17). Data are mean + 
s.e.m. *P< 0.05, **P<0.01, ***P < 0.001. See Supplementary Table 1 for 
further details of the statistical analyses. 


Changes in arousal can be associated with positive or negative states. 
We questioned whether vMT stimulation is rewarding, aversive or neu- 
tral. ChR2-expressing mice (n = 14) were placed in a two-chamber 
arena and the vMT was stimulated using a fibre-optic cannula every 
time the mouse entered one particular side of the chamber (Fig. 5j). 
The stimulation was ceased when the mice explored the opposite side. 
This real-time place preference (RTPP) test revealed a strong prefe- 
rence for the vMT-stimulated side (Fig. 5k, 1). This did not reflect an 
alteration in locomotor activity (Extended Data Fig. 6). vMT activation 
also did not disrupt the perception of danger cues in a visual cliff test, 
aggression or tail rattling in response to a male intruder, or overall 
levels of anxiety in an open-field test (Extended Data Fig. 6). Therefore, 
increases in arousal induced by vMT stimulation are rewarding; that is, 
they have positive valence. 


vMT firing scales with perceived threat 

Next we carried out single-unit tetrode recordings from freely behaving 
mice before, during and after exposure to looming stimuli (Fig. 6a; 
see Methods). Example spiking activity from five different vVMT cells in 
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response to the first round of looming stimuli is shown in Fig. 6b. Four 
out of five cells showed increased firing to looms relative to the pre- 
loom condition. Overall, the firing rates of VMT neurons increased by 
approximately 30% in response to looms on the first day of loom expo- 
sure (Fig. 6c), which is when the most robust behavioural responses 
to loom occur. 

Aligning vMT-spiking activity with the behaviours exhibited during 
looming showed that vMT spiking scaled with the degree to which 
the mice exhibited different behavioural responses. The firing rates of 
vMT neurons markedly increased when the mouse tail-rattled, walked 
or ran, but not when the mouse froze (Fig. 6d). Overall, there was a 
significant increase in vMT firing when the mouse exhibited salien- 
cy-enhancing behaviours as opposed to saliency-reducing behaviours 
to looming (Fig. 6d-f). 

It is well established that mice habituate their responses when loom- 
ing is repeatedly presented over several days’. We explored whether 
vMT firing adjusts to those changes, and found that the relative pre- 
loom versus loom firing rate of VMT neurons attenuated in parallel 
with the behavioural attenuation (Fig. 6g), an effect also seen with 
c-Fos (Extended Data Fig. 8). These data indicate that the firing of 
vMT neurons scales according to the novelty of a threat stimulus, and 
in a way predicts the changes in behavioural responses to visual threats 
over time. 


Discussion 

We show that when vMT neurons are activated before, or during, sim- 
ulated predator looms or olfactory threat, mice shift from saliency- 
reducing to saliency-enhancing reactions to those threats. Whereas 
looms typically induce freezing, activation of the vMT or the 
vMT—mPFC pathway induced tail rattling—a behaviour associ- 
ated with aggression'!-!%, Although the range of emotional states 
represented by behaviours such as tail rattling is likely to be broad, 
saliency-enhancing behaviours in general can be thought of as 
‘courageous’ because they increase the visibility of the mouse in the 
direct presence of a perceived threat. Indeed, from the perspective 
of predator-prey interactions, avoiding predation reflects a trade- 
off between avoiding detection (for example, freezing, hiding or 
camouflage) and avoiding capture (for example, running or fighting); 
which category of behaviour is adaptive requires ongoing risk 
assessment. Our data indicate that arousal level appears to bias which 
category of behavioural response mice will exhibit in response to 
threats (Extended Data Fig. 9). Activation of the vMT consistently 
increased arousal and saliency-enhancing ‘courageous’ behaviours 
to threats. This is unlikely to indicate increased stress because our 
real-time place preference test showed that the internal state caused 
by vMT activation is rewarding (that is, it has positive valence). In 
light of this, it is notable that at least two previous studies found that 
self-stimulation of the human brain area that is homologous”® to the 
rodent vMT is strongly reinforcing, even more so than activation of 
areas associated with sexual arousal”*?”. 

The vMT structures described here reside well outside the primary 
visual pathway and receive diverse sources of input from sensory struc- 
tures, including the superior colliculus*!, as well as from hindbrain 
areas associated with arousal and emotion, such as the raphe and the 
periaqueductal grey*”*? (Extended Data Fig. 10). Thus, the vMT may 
act as a hub for integrating several types of sensory inputs with inter- 
nal states in order to select specific categories of behavioural outputs. 
In contrast to the vast inputs to the vMT, the outputs of the VMT are 
fairly restricted: they include the basolateral amygdala, the medial 
prefrontal cortex (Fig. 3) and the nucleus accumbens (Extended Data 
Fig. 3), structures all shown previously to be associated with fear and/or 
reinforcement*16?4, 

Our data point to the powerful influence that activation of thalamic 
midline structures can have on arousal and behavioural responses to 
threatening stimuli. Other midline-residing structures—such as the 
ventromedial hypothalamus and the periaqueductal grey—have been 
shown by others to control aggression and arousal***, which raises 
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Fig. 6 | vVMT neurons are activated during saliency-enhancing 
behaviours. a, Tetrode recordings in the vMT were performed during 
presentation of the looming stimulus. b, Sample raster plots of five isolated 
units before, during and after the looming stimulus. c-e, Mean relative 
firing rates on the first day of loom presentation (c, 23 cells from 4 mice), 
during different behaviour epochs (d, 87 total cells from 4 mice) and 
during motile and immotile behaviours (e, 67 cells from 4 mice), relative 


ideas about whether midline nuclei of the hind-, mid- and forebrain 
may represent a larger circuit devoted to sensory-autonomic modula- 
tion. In the future, experiments that adjust the activity of several brain 
areas such as the periaqueductal grey, the ventromedial hypothalamus 
and the vMT could shed light on this idea. The strong convergence of 
circuitries involved in arousal, reward and action selection to perceived 
threats in structures such as the vMT also raises the possibility that 
dysregulation of midline thalamic function may underlie disorders of 
these processes such as addiction, phobias and post-traumatic stress 
disorder. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0078-2. 
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METHODS 


Mice. All experimental protocols were conducted in accordance with the National 
Institutes of Health (NIH) guidelines for animal research and were approved by 
the Stanford University and University of California, San Diego Animal Care and 
Use Committee (IACUC). Mice were housed under a 12-h light:dark cycle with 
food and water ad libitum. Male and female C57BL/6 mice were 8-12 weeks old 
at the time of viral injection. All subjects were randomly assigned to different 
experimental conditions used in this study. Group sample sizes were chosen on 
the basis of previous studies. 

Viruses. AAV-hSyn-hM3D(Gq)-mCherry, AAV-hSyn-hM4D(Gi)-mCherry, AAV- 
hSyn-DIO-hM3D(Gq)-mCherry, AAV-hSyn-DIO-mCherry, and AAV-hSyn-eGFP 
vectors were packaged by the Vector Core at the University of North Carolina 
at Chapel Hill (UNC Vector Core). AAV-Efla-DIO-hChR2(H134R)-eYFP was 
provided by B. K. Lim (UCSD) and K. Deisseroth (Stanford) and was combined 
with AAV-iCre (Vector BioLabs) to induce opsin expression. AG-rabies-mCherry 
and AG-rabies-GFP were produced by the Salk Institute Gene Transfer, Targeting 
and Therapeutics Core (GT3). CAV2-Cre virus was obtained from The Institute 
of Molecular Genetics of Montpellier vector core (France). All viral vectors were 
stored in aliquots at —80°C until use. 

Stereotaxic surgery and drive implantation. Mice were anaesthetized with 
1.5-3.0% isoflurane and given the analgesic buprenorphine (subcutaneously, 
0.3mgkg~!). A midline scalp incision was performed to expose the skull. After 
performing a craniotomy, 0.5 1l of viral vectors (titer, 10!” particles per ml) were 
injected into the vVMT (bregma: —1.0mm, midline: 0.0mm, dorsal surface: 
—4.0 mm) using a Nanoject II (Drummond) injector. The virus was slowly infused, 
and 10 min after infusion the glass pipette (with a resistance of approximately 
0.5 MQ) was retracted. Half of the mice in each cage were randomly assigned to 
either control or treatment groups. All control mice were treated with the same 
experimental procedures, but AAV-XFP was injected instead. 

To determine the inputs to the vMT, 0.1-0.4 jl of AG-rabies-mCherry was 
injected into the vMT. Mice were then housed in a biosafety room for 4-6 days 
to allow the rabies virus to infect and express mCherry in presynaptic cells. To 
determine the outputs, AAV-GFP was injected into the vMT. The entire brain was 
sectioned and imaged on a Zeiss slide scanner. 

For activating vMT output neurons, CAV-Cre was injected bilaterally into the 
BLA (bregma: —1.3 mm, midline: 3.0 mm, dorsal surface: 4.75 mm), the mPFC 
(bregma: +-2.5 mm; midline: 0.25 mm; dorsal surface 1.5mm) or the NAc (bregma: 
+1.3 mm; midline: 1.0 mm; dorsal surface 4.5mm). In the same mice, AAV-DIO- 
hM3D(Gq) was injected into the vMT. Mice were tested after at least three weeks 
of recovery. 

To determine whether the vMT projection neurons to BLA and mPFC represent 
distinct populations, CT8—488 (green) was injected into the mPFC and CTB—594 
(red) was injected into the BLA of the same mouse. The vMT was examined for 
cells with both CT8—488 and CT8—594. To further determine whether vMT pro- 
jection neurons bifurcate to the mPFC and the BLA, mice injected with CAV-Cre 
to either target and a Cre-dependent AAV to the vMT were examined for axons 
expressing mCherry in the mPFC and the BLA. 

For optogenetic experiments, AAV-ChR2 or AAV-GFP was injected into the 
vMT. Then, a fibre optic cannula (200 jum diameter, 0.22 NA; Doric Lenses) was 
implanted above the vMT or the mPFC and was affixed to the skull with dental 
cement. Mice were given four weeks to recover for stimulating in the vMT and six 
weeks to recover for stimulating in the mPFC. 

For tetrode recording from the vMT, mice were implanted with a microdrive 

with two or four tetrodes (bregma: —1.0 mm, midline: 0.0 mm, dorsal surface: 
—3.5mm). Tetrodes composed from 17-\1m polyimide-coated platinum-iridium 
90-10% wire (California Fine Wire Company) were attached to the microdrive. 
The electrode tips were cut flat and plated with platinum to reduce impedances to 
100-300 kQ at 1 kHz. The microdrive was secured to the skull with dental cement 
and jeweller’s screws with one screw connected to the microdrive ground. Mice 
were given three days to recover from surgery before habituation to the headstage 
and multi-wire tether. 
Recording procedures and spike sorting. After surgery, the tetrodes were turned 
into the vMT in small increments to reach the appropriate depth at the end of each 
day. Mice were placed in the looming chamber to record pre-, during- and post- 
loom activity. Because the headstage partially blocked visual perception directly 
overhead, looming stimuli were presented at an angle. To ensure the same cell was 
not counted twice, unit activity was collected from sessions with at least 451m 
separation from previous loom sessions. The microdrive was connected to the 
recording equipment through an AC coupled unity-gain operation amplifier. Spike 
amplitudes above 40,:V (around two times the level of noise) were collected. 

Spikes were sorted manually with offline cluster-cutting software (TINT soft- 
ware, Axona) based on clustering and waveforms. Autocorrelation and cross- 
correlation functions were used to separate cells. Only cells with a peak firing rate 
greater than 1 Hz pre-, during- or post-looms were analysed. 


Light delivery. Mice with fibre optic implants were connected to an optical fibre 
(200 jum diameter, 0.22 NA; Doric Lenses) that delivers blue light from a 473 nm 
laser. Blue light was delivered in 5-ms pulses at 20 Hz for 30s before or during 
the entire duration of the looming stimuli. The light power at the fibre tip was 
about 10-20 mW, ensuring a light intensity of approximately 1 mW mm ~ over 
the entire vMT. 

Clozapine N-oxide delivery. For chemicogenetics, mice were administered 
clozapine N-oxide (CNO, Toric Bioscience; 1.5 mg kg! for mice with hM3Dq, 
5.0mgkg~! for mice with hM4Di) by an intraperitoneal injection 30 min before 
behavioural analyses. 

Behavioural analyses. Mice were handled every day for three days before behav- 
iour testing. No significant differences in behaviour responses were observed 
between males or females tested during night and day. All behaviour was per- 
formed during consistent hours in the afternoon or evening. When mice were 
tested in several behaviour assays, they were given at least 48h rest between tests. 
All behaviours were performed and scored blind to treatment group. 

Looming behaviour. Mice were placed in a 50cm x 25cm x 40cm glass chamber 
with a 24-inch LCD monitor stationed on top facing downwards to display the 
stimulus. The floor and three walls of the chamber were covered with a matte 
coating (Krylon) to prevent reflections of the looming stimulus. At one end of the 
box, there was a 12-cm-high black shelving board to provide a shelter under which 
mice could hide. A top video camera (Yi Action Camera) and a side video camera 
(Panasonic HC-W850 camcorder in nightshot mode, 30 fps) recorded the mouse’s 
movements during the session. Mice were given 10 min to habituate to the arena. 
Then, the looming stimulus (black disc on a white background, 15 expansions 
over 24s) was initiated when the mouse was in the centre of the arena. All mice 
were tested only once owing to the fact that they can habituate to the looming 
stimulus even after one trial of looming presentation. The locomotor behaviours 
of the mice were recorded and analysed automatically with a video tracking system 
(Biobserve software). 

The behaviour of the mice was characterized on the basis of the following crite- 

ria. Freezing was defined as episodes of three or more seconds in which the mouse 
was completely immobile except for movements from respiration. Running was 
defined as events in which the speed of the mouse was greater than two times the 
average speed before stimulus onset. Hiding was defined as episodes in which the 
mouse’s body was completely under the shelter provided. Tail rattling was defined 
as events in which the mouse rapidly moved its tail back and forth (1 bout of 
rattling = 1s of rattling). Ambulation was defined as all other motile behaviours 
performed in the open arena. 
Pupillary light reflex. To test the arousal-driven aspect of the pupillary reflex, mice 
were adapted to constant, ambient room light (1001x) for 1h. All mice were kept 
unanaesthetized and manually restrained during the experiment. The pupil was 
recorded under constant light conditions for 30s before stimulation (baseline), 30s 
during stimulation, and 30s after stimulation. Control mice were treated exactly 
the same except for ‘shan’ stimulation. 

To test light-driven pupillary reflexes, mice were dark-adapted in their home 
cages for 1h before the experiment. A single blue LED (470 nm, 10001x) was placed 
in front on the left eye. The right eye (consensual pupil reflex) was recorded before 
(baseline), during (constriction) and after (dilation) the 30s blue light stimulus. 
Mice were again dark-adapted for 1h after the first trial. Then, the same mice were 
tested for pupillary reflex with optogenetic stimulation, chemicogenetic stimu- 
lation or sham stimulation. The pupil size was later measured in ImageJ by an 
experimenter blind to treatment. 

Real-time place preference. Mice were placed into a two-chamber acrylic box 
(40cm x 20cm x 25cm, each chamber 20cm x 20cm). Each chamber has different 
contextual cues on the wall. Mice were monitored for 20 min. When all four paws 
were in the stimulation chamber, the stimulation was turned on and remained on 
until the mouse left the chamber. The mice were tracked with automated tracking 
software (Biobserve) and the time spent in each chamber was recorded. 

Home cage vMT activation. Mice were monitored in their home cages before and 
during vMT activation by chemicogenetic or optogenetic methods. For optoge- 
netic stimulation, a range of laser intensities (2 mW, 5mW, 10 mW and 20 mW) 
and a range of stimulation frequencies (5 Hz, 10 Hz and 20 Hz) were tested to 
determine an optimal stimulation protocol and monitor behaviour. Behaviour was 
scored and locomotion behaviour was analysed using automated tracking software 
(Biobserve). No changes in typical behaviours were observed during stimulation 
in the home cage (data not shown). 

Predator odour avoidance. Mice were individually housed and dark-adapted 
before testing. All behaviour testing was performed under dim red light at night. 
Medical gauze with trimethylthiazoline (50% TMT, 5,11), or no odour (clean 
gauze) was placed in the cage. Behaviour was monitored for 10 min during odour 
exposure. 

Heart rate and breathing rate measurements. Heart rate and breathing rate were 
recorded at 1 Hz in freely moving mice using a pulse oximeter (MouseOx Plus; 
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Starr Life Sciences). Mice were shaved around the neck to enable the red light to 
pass through to the detector. Recordings were obtained for 10 min in the home 
cage with intermittent 1-min periods of blue light stimulation. Data were collected 
using the MouseOx Plus Conscious Applications Software. 

Resident-intruder test. Behaviour assays were performed under dim red light in 
the dark cycle. Male mice were housed with a female for >7 days. The female was 
removed 1h before testing. Solitary mice were tested twice for aggression with WT 
129/Sv socially housed stimulus mice of similar body weight for 10 min. Aggressive 
behaviours (attack, tail-rattle) and non-aggressive behaviours (sniffing, grooming 
intruder) were scored. 

Open-field test. Mice were placed in the outer zone of the open-field chamber 
(50cm x 50cm) for 18 min. The 473 nm laser was intermittently turned on and 
off with 3-min periods. Total time and distance in the centre (36cm x 36 cm) was 
analysed using automated tracking software (Biobserve). 

Visual cliff. The visual cliff behaviour was analysed in an open-top Plexiglass 
chamber. Half of the box was protruded from the counter to provide a 3-foot 
depth. The same checkerboard pattern was displayed on the countertop and the 
floor below. A 5-cm high platform was stationed in the middle of the box equally 
intersecting both the shallow side and the deep side. The mouse was placed on the 
platform and was allowed to choose between the two sides to step down on. If the 
mouse stepped down on the shallow side with all four paws, the behaviour choice 
was designated as shallow. All other behaviour choices (0-3 paws on the shallow 
side) were designated as deep. Each mouse performed the task five times per day 
to give a total score of percentage choosing the shallow side. 

Sweep behaviour. Mice were placed in the same chamber as that used for the 
looming behaviour with a 24-inch LCD monitor stationed on top facing downward 
to display the sweep stimulus. Mice were given 10 min to habituate to the arena. 
Then, the sweep stimulus (2.5-cm diameter black disc on a white background, 
sweeps across the screen in 4s) was initiated when the mouse was in the centre of 
the arena. All mice were tested only once owing to the fact that they can habituate 
to the sweep stimulus even after one trial. The locomotor behaviours of the mice 
were recorded and analysed automatically with a video tracking system (Biobserve 
software). 

Immunohistochemistry. After transcardinal perfusion with saline and then 4% 
paraformaldehyde, brains were collected and postfixed for 24h. The brains were 
then transferred to 30% sucrose for cryoprotection and sectioned coronally at 45 jum. 
If needed, brains were kept at 4°C overnight with the following antibodies: 
rabbit-anti-c-Fos to label active neurons or rabbit-anti-DsRed to enhance the 
mCherry signal. For secondary detection, Alexa Fluor 488 donkey anti-rabbit 
or Alexa Fluor 594 donkey anti-rabbit were used. Immunostained tissues were 
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imaged with a Zeiss LSM 880 Airyscan confocal microscope or a Zeiss AxioScan 
microscope (Zeiss, Germany). 

c-Fos induction. After behaviour assays, the brains were collected for a standard 
c-Fos induction protocol. For visual threat experiments, mice were exposed to a 
visual stimulus for 5 min: looming above, below or no looming stimulus. 

To analyse the efficiency of viral infection and cell activation or inactivation, a 
similar c-Fos induction protocol was used. For optogenetic experiments, a train 
of 5-ms light pulses at 20 Hz over 3 min was delivered. For chemicogenetics, mice 
were given CNO to induce the appropriate change in neural activity. 
Quantification of cell number. For mice with CAV-Cre injected to the BLA or 
mPFC and AAV-DIO-hM3D-mCherry into the vMT, we compared the number 
of mCherry* cells in the vMT. For c-Fos screens, we compared the number of 
c-Fos* cells in the vMT and Xi in control mice and mice that experienced visual 
threats. After we immunostained the brains with identical protocols, the brains 
were imaged on an automated slide scanner (Zeiss). A brain map was overlain on 
the digital image to identify the appropriate regions using landmark structures, 
white matter tracts, ventricles and optic tract as reference guides. An experimenter 
blind to condition analysed the images using the automated ImageJ cell counter. 
Statistics and reproducibility. All statistical analyses were performed in R v.3.3.2. 
(R Development Core Team 2017, https://www.r-project.org/) or GraphPad Prism. 
The sample sizes were chosen based on common practice in animal behaviour 
experiments. Data were first tested for normality. If the null hypothesis that the 
data come from a normal distribution cannot be rejected, parametric tests were 
used (for example, Student's t-test, one-way ANOVA with Tukey’s post-hoc test). 
Otherwise, non-parametric tests were used. Paired tests were used to compare 
within-group repeated-measures data. Owing to the strong non-normality of the 
looming behaviour data, generalized linear models (GLMs) with non-normal dis- 
tributions or non-parametric tests were used for the looming behaviour analysis. 
All run count data were analysed using a GLM with a Poisson distribution. All 
tail-rattle count data were analysed using a GLM with a quasi-Poisson distribution 
because the data were over-dispersed. Freeze, hide and ambulation data did not 
fit well to regression models, so non-parametric Kruskal-Wallis tests with Dunn's 
correction were performed. Percentages of mice and binary data were analysed 
with a Fisher's exact test. All statistical tests were two-tailed. Significance levels 
are indicated as follows: *P< 0.05; **P<0.01; ***P < 0.001. For representative 
images, similar results were obtained in at least three independent trials. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. The data that support the finding of this study are available from 
the corresponding authors upon reasonable request. 
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Extended Data Fig. 1 | hM3Dq activates vMT, whereas hM4Di 
inactivates vMT. Refers to Figs. 1 and 2. a, Timeline of the c-Fos induction 
protocol. b-d, Recently active c-Fos* neurons (green) in the vMT of 

mice that were exposed to the looming stimulus, with XFP (b), hM4Di 

(c), or hM3Dq (d) injections (red) into the VMT and CNO delivered 
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running (b, f), freezing (c, g), or hiding (d, h). Controls include: mice 
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but without CNO; and mice with AAV-XFP, optrode implant and sham 
stimulation. i-], Comparison of male and female control mice (i, k) and 
mice with vMT activation (j, 1) reveals no significant sex differences 
across any of the behaviours performed in response to the looming threat. 
Notably, both male and female mice tail-rattle in response to the looming 
threat. Data are mean + s.e.m.; NS, not significant. See Supplementary 
Table 1 for statistical analysis and sample sizes. 
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that project to the NAc, CAV-Cre was injected into the NAc and Cre- 
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Extended Data Fig. 4 | Viral targeting and number of cells infected 

in the vMT. Refers to Figs. 3 and 4. a, To activate VMT neurons that 
project to the mPFC or the BLA, CAV-Cre was injected into the mPFC 

or the BLA and Cre-dependent hM3D was injected into the vMT. b, The 
average number of infected hM3d-mCherry* vMT cells did not differ 
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with tail-rattling behaviour. f, g, Representative images of hM3dq- 
mCherry/Cre* neurons (red) in the vMT that project to the mPFC. 

h, Mice were injected with AAV-ChR2 in the vMT to activate the vMT. 

i, j, Representative images of ChR2-eYFP* neurons (green) in the VMT 
and the fibre tracts composed of vMT axons that project to the mPFC. 
Data are mean + s.e.m.; NS, not significant. See Supplementary Table 1 for 
statistical analysis and sample sizes. 
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Extended Data Fig. 5 | vMT activation results in saliency-enhancing 
behaviours performed in the open arena. Refers to Figs. 3 and 4. 

a, Percentage of mice tail-rattling in response to looming after VMT—PFC 
or VMT-—BLA activation. All of the mice with YMT—mPFC optogenetic 
stimulation displayed tail-rattling behaviour. b, c, Percentage of tail- 
rattling (b) or running (c) events performed in the open arena as opposed 
to in the shelter. Mice with VMT— mPFC optogenetic stimulation 
perform most tail-rattling and running events in the open. d, Optogenetic 


activation of the vMT results in most mice tail-rattling. e, Mice with 
vMT optogenetic stimulation perform most tail-rattling events in the 
open arena as opposed to in the shelter. f, Mice with vMT optogenetic 
stimulation perform most running events within the open arena as 
opposed to towards the shelter. Data are mean + s.e.m. *P< 0.05, 

**P< 0.01, ***P < 0.001; NS, not significant. See Supplementary Table 1 
for statistical analysis and sample sizes. 
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Extended Data Fig. 6 | vMT activation does not change locomotion, 
aggression or anxiety, but does result in less freezing in response to a 
predator odour. Refers to Figs. 4 and 5. a, Mice were injected with AAV- 
ChR2 in the vMT in order to activate the vMT during presentation of 

the predator-odour threat. b, Activating the vMT (n=9 mice) decreased 
freezing as compared to controls with sham stimulation (n =7 mice). 

c, Activating the vMT decreased avoidance as compared to controls 

with sham stimulation (avoidance index = (P — 50)/50, where P is the 
percentage of time the mouse spent on the side of the arena away from 
the odour). d, The visual cliff test. e, Activating the vVMT (hM3D + CNO, 
n=15 mice) or inactivating the VMT (hM4D + CNO, n = 9) during the 
visual cliff test did not change the number of times in which the mouse 
chose the shallow side as compared to control mice (n = 9 mice, P< 0.05). 
f, The RTPP test. g, Activating the VMT (ChR2, n=7 mice) did not change 
the relative activity (distance covered) in the RTPP test as compared to 
control mice (n =6 mice). h, Mice were tested on the resident-intruder 
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test for aggressive behaviours. i, Activating the VMT (ChR2, n=7 mice) 
did not change the average number of tail-rattling events in the resident- 
intruder test as compared to control mice (n = 14 mice). j, k, Activating 
the vMT (ChR2, n=7 mice) did not change the percentage of mice 
attacking (j) or the latency to attack (k) in the resident-intruder test as 
compared to control mice (n= 14 mice). 1, Mice were subjected to an open 
field test to analyse anxiety-related behaviour; representative tracing of a 
control mouse (AAV-GEP, left) and a mouse with vMT activation (AAV- 
ChR2, right) in the open field test. m, Activating the VMT (ChR2, n= 10 
mice) did not change the percentage of time in the centre of the open field 
as compared to control mice (n= 10 mice). n, Activating the vMT (ChR2, 
n= 10 mice) did not change the relative activity (distance covered) in the 
open field test as compared to control mice (n = 10 mice). Data are mean 
+ s.e.m. *P < 0.05; NS, not significant. See Supplementary Table 1 for 
statistical analysis and sample sizes. 
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Extended Data Fig. 7 | vMT activation results in increased arousal. 
Refers to Fig. 5. a, Schematic of the experimental set-up to analyse light- 
driven pupillary responses in mice. Treating control mice with CNO did 
not change pupil size across all light levels. b, Chronic vMT activation by 
chemicogenetic methods (+CNO) significantly increases pupil size across 
all light levels as compared to the same mice without activation (—CNO). 
c, Chronic vMT inactivation (+CNO) did not the change pupil size across 
all light levels as compared to the same mice without activation (—CNO). 
d, Schematic of experimental set-up to analyse arousal-driven pupillary 
responses in mice with vMT activation, mice with vMT inactivation and 
control mice. The pupil was measured in constant light (100 lx) conditions 
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in mice with and without CNO. e, After CNO delivery, mice with hM3Dq 
had a significant increase in relative pupil size. Mice with XFP and hM4Di 
did not have a significant change in pupil size after administration of CNO. 
f, In constant dark conditions, vMT activation significantly increased pupil 
size, whereas vMT inactivation did not change pupil size as compared 

to control mice with CNO. g, In constant dark conditions, optogentic 
activation of the vMT significantly increased pupil size (n = 11 mice) 
compared to control mice (n = 12 mice). Data are mean + s.e.m. *P < 0.05; 
**P< 0.01; ***P < 0.001; NS, not significant. See Supplementary Table 1 
for statistical analysis and sample sizes. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


a . 
Naive group: Dark Visual c-Fos 
adapt stimuli expression 
No prior exposure 
to loom | =e 
Habituated group: 
Repeated Dark Visual c-Fos 
loom exposure adapt stimuli expression 
4 days Sap 5S ae 


Extended Data Fig. 8 | Looming stimuli induce vMT activation in 
naive, but not experienced mice habituated to looms. Refers to Fig. 6. 
a, Schematic of the experimental protocol. b, c, Quantification of c-Fos* 
cells in the vMT of naive or pre-exposed mice that experienced looming 
from above revealed a significant decrease in c-Fos" cells in the VMT 
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(b) and Xi (c) after looming from above in pre-exposed mice that are 
habituated to the looms (mn =6 mice) as compared to naive mice (n=7 
mice). Data are mean + s.e.m. **P < 0.01; ***P < 0.001. Supplementary 
Table 1 for statistical analysis and sample sizes. 
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Extended Data Fig. 9 | Tail rattling and running scales with arousal. 
Refers to Figs. 4 and 5. a, b, Mice in which ChR2 had been injected into 
the vMT were divided into three groups based on the extent to which VMT 
activation increased arousal responses: low, moderate or high (n=7,n=9 
and n= 10 mice, respectively). Mice with high arousal had significantly 
more tail-rattling (a) and running (b) events in response to looming, as 
compared to mice with low arousal (P< 0.001 and P< 0.001). ¢, d, Mice 
with high arousal spent similar amounts of time freezing (c) and hiding (d) 


in response to looming, as compared to mice with low arousal. e, f, 100% 
of mice with high arousal tail-rattled (e) and ran (f) in response to 
looming, whereas only 33% and 20% of mice with low arousal tail-rattled 
and ran, respectively. g, h, Similar numbers of mice with high arousal 
froze (g) and hid (h) in response to looming as compared to mice with 
low arousal. Data are mean + s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001; 
NS, not significant. See Supplementary Table 1 for statistical analysis and 
sample sizes. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


a : b Relative density of projection neurons to the vMT 
AG-rabies-XFP 

(Retrograde tracer) infralimbic cortex +++ ventromedial hypothalamus ++ 
prelimbic cortex +++ subiculum ++ 

anterior cingulate cortex ++ parabrachial nucleus + 

superior colliculus ++ insular cortex + 

periaquiductal grey ++ ectorhinal cortex + 

dorsal raphe nucleus ++ medial mammillary nucleus + 

2 vMT median raphe nucleus ++ medial preoptic nucleus + 
suprachiasmatic nucleus ++ ventral lateral genicuate nucleus + 


Labeling: light+ moderate++ dense+++ 


Retrograde labeling of projection neurons to the vMT 


Superior Colliculus (SC) —> vMT Dorsal Raphe (DRN) —>vMT Median Raphe (MRN) —> vMT 


, and median raphe (e). DRN, 


Extended Data Fig. 10 | Hindbrain inputs to the vMT. a, Schematic dorsal raphe (d), periaqueductal grey (d) 


illustration of AG-rabies-XFP injection into the vVMT to map vMT inputs. dorsal raphe; MRN, median raphe; PAG, periaqueductal grey; PRN, 
b, Quantification of the relative density of projection neurons to the VMT pontine reticular nucleus; SCm, superior colliculus, motor; SCs, superior 
(n= 12 mice). c-e, Representative images showing the expression of AG- colliculus, sensory. Scale bars, 100 jum. 


rabies-XFP in transynaptically labelled cells in the superior colliculus (c), 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


https://doi.org/10.1038/s41586-018-0062-x 


Cryo-EM structure of substrate-bound 
human telomerase holoenzyme 
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The enzyme telomerase adds telomeric repeats to chromosome ends to balance the loss of telomeres during genome 
replication. Telomerase regulation has been implicated in cancer, other human diseases, and ageing, but progress towards 
clinical manipulation of telomerase has been hampered by the lack of structural data. Here we present the cryo-electron 
microscopy structure of the substrate- bound human telomerase holoenzyme at subnanometre resolution, showing two 
flexibly RNA -tethered lobes: the catalytic core with telomerase reverse transcriptase (TERT) and conserved motifs of 
telomerase RNA (hTR), and an H/ACA ribonucleoprotein (RNP). In the catalytic core, RNA encircles TERT, adopting a 
well-ordered tertiary structure with surprisingly limited protein-RNA interactions. The H/ ACA RNP lobe comprises 
two sets of heterotetrameric H/ ACA proteins and one Cajal body protein, TCAB1, representing a pioneering structure of 
a large eukaryotic family of ribosome and spliceosome biogenesis factors. Our findings provide a structural framework 
for understanding human telomerase disease mutations and represent an important step towards telomerase-related 


clinical therapeutics. 


Linear eukaryotic chromosomes are capped by telomeric repeat 
tracts that recruit the telomere-binding proteins that are essential to 
distinguish telomeres from DNA breaks and thus to avoid telomere 
end-resection and interchromosomal fusions!”. Owing to inherently 
incomplete genome replication, telomeres are progressively shortened 
in each cell cycle*. Critically short telomeres result in genome instability 
and cell death**°. To compensate for this sequence loss, a specialized 
reverse transcriptase, telomerase, adds telomeric repeats to the chro- 
mosome 3’ end, using TERT and an integral telomerase RNA subunit 
(TER) with an internal template for repeat synthesis®. Human telomer- 
ase activation in embryogenesis and its repression in somatic tissues 
govern cellular renewal capacity, with telomerase deficiency imposing 
haematopoietic and epithelial failures and aberrant telomerase activa- 
tion enabling tumorigenesis’. 

Despite its medical relevance, efforts to design drugs that target 
telomerase have been hampered by limited structural information. 
Endogenous human telomerase holoenzyme is scarce. Most studies of 
human telomerase use cellular overexpression of limiting TER (hTR 
in humans) and TERT®. Two hTR domains are critical for activity®””: 
a domain with the template and adjacent pseudoknot (t/PK) and the 
conserved regions 4 and 5 (CR4/5), consisting of a branched junction 
of stems P5 and P6, and the activity-critical stem-loop P6.1 (Fig. 1a). 
Together with TERT, these domains are sufficient to reconstitute telo- 
merase activity with the repeat addition processivity characteristic of 
the cellular holoenzyme!'. TERT is composed of four domains: the 
N-terminal (TEN) domain, the high-affinity RNA-binding domain 
(TRBD), the reverse transcriptase domain (RT), and a C-terminal 
extension (CTE) analogous to a polymerase thumb" (Fig. 1a). The 
high-resolution structure of a streamlined TERT from the flour bee- 
tle Tribolium castaneum, which lacks the TEN domain, established 
that the latter three domains form a ring, referred to as the TERT 
ring”. The TERT ring alone can bind a template-primer duplex’, 
and, accordingly, the human TERT ring has been shown to support 
single-repeat synthesis’. Purified or overexpressed hTR and TERT 


form heterogeneous complexes, only some of which are catalytically 
active'°. It remains unclear whether the active enzyme requires an 
hTR and TERT dimer’®”” or only a monomer of each subunit’>"®. 
The landmark 9 A cryo-electron microscopy (cryo-EM) structure of 
the ciliate Tetrahymena thermophila telomerase holoenzyme revealed 
its overall architecture, including a monomeric TERT-TER catalytic 
core!*. However, the subunit composition of the Tetrahymena telomer- 
ase holoenzyme is distinct from its human counterpart, and available 
structural information on human telomerase holoenzyme is limited to 
a 30A negative-stain electron microscopy reconstruction’. Therefore, 
both the structure and the composition of the human telomerase holo- 
enzyme remain poorly characterized. 

Telomerase assembly initiates in human cells with nascent hTR 
binding to H/ACA proteins and their chaperones”’. Human H/ACA 
proteins are shared by telomerase and the small nucleolar (sno) and 
small Cajal body (sca) RNP families, which catalyse ribosomal and 
spliceosomal RNA modifications, respectively’. Eukaryotic H/ACA 
snoRNPs contain the pseudouridine synthase dyskerin (Cbf5), NOP 10, 
NHP2, and GARI, assembled on each of two hairpin stems; and 
scaRNPs also contain TCAB1/WDR79 bound to loop CAB-box motifs. 
The hTR H/ACA 3’ hairpin loop has a CAB box but the 5’ hairpin 
instead extends into CR4/5 (Fig. 1a). All eukaryotic H/ACA RNPs, 
including telomerase, require both hairpins of the H/ACA motif 
for RNP biogenesis in cells’, unlike their single-hairpin archaeal 
counterparts”!, but it remains unclear whether the active human 
telomerase holoenzyme has one or two sets of H/ACA proteins'®, 
only dyskerin”’, or only NOP10 and dyskerin’”. Syndromes of human 
telomerase deficiency have been traced to mutations in dyskerin, 
NOP10, NHP2 and their chaperone NAFI1°. While the structure of 
the related single-hairpin archaeal RNP is known”‘, there is no 
high-resolution structure of any complete eukaryotic H/ACA RNP. 

Here, to address the structure and composition of human telomer- 
ase holoenzyme, we optimized active telomerase purification and used 
cryo-EM to determine its structure at subnanometre resolution, which 
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Fig. 1 | Telomerase holoenzyme reconstitution and characterization. 

a, Domain architecture of TERT (top) and secondary structure of 

hTR (bottom). Paired stems (P) are shown compacted in length. b, Silver- 
stained SDS-PAGE gel showing fractionation of purified, heterogeneously 
active TERT RNP. Proteins were detected by mass spectrometry and 
immunoblotting (Extended Data Fig. 1a, Extended Data Table 1). ¢, In 
vitro telomerase assays performed on the three elution fractions shown 

in b, the beads after elution, and fivefold diluted El. A 12-nucleotide 


allowed us to fit available high-resolution structures and homology 
models of all components. Our structure reveals in detail the organ- 
ization of ten protein subunits and hTR, illuminating the structural 
basis of hTR motif functions and human disease mutations that cause 
telomerase deficiency. 


Activities of distinct structural states 

We reconstituted human telomerase holoenzyme by transient transfec- 
tion of HEK 293T cells with vectors expressing TERT and hTR®. After 
affinity purification steps to enrich for hTR and TERT, the predomi- 
nant additional proteins in the sample were TCAB1 and the H/ACA 
proteins (dyskerin, NOP10, NHP2 and GAR1), each of which was 
assigned by immunoblotting and, with the exception of glycine/ 
arginine-rich GARI, also confirmed by mass spectrometry (Fig. 1b, 
Extended Data Fig. la, Extended Data Table 1). Furthermore, pro- 
tein identities were confirmed by subunit drop-out purifications (see 
below). Direct primer-extension assays of eluted fractions showed 
that the amount of TERT did not fully correlate with the amount 
of telomerase activity; also, dilution of the less-active fraction E1 
increased rather than decreased activity (Fig. 1c). Visualization by 
negative-stain electron microscopy of El and E2, which we refer to as 
inactive and active fractions, respectively, revealed morphologically 
distinct populations of particles. Particles from the inactive fraction 
were more elongated (typically 400 A or more in length) and more 
heterogeneous than those in the active fraction (Fig. 1d, e). Particles 
from the active fraction consisted of two asymmetric lobes, nearly 
300 A in length (Fig. 1e). Previous studies have shown that cellular 
human telomerase reconstitution yields complexes both monomeric 
and dimeric for TERT!™"”, explaining the observed particle hetero- 
geneity. For cryo-EM studies, we modified our purification protocol 
to recover only active fractions (Extended Data Fig. 1b, c), added 
a DNA substrate (T;4AG3) during purification, and confirmed the 
presence of the bound substrate in the purified RNP (Extended Data 
Fig. 1c). We also demonstrated that our tagged TERT was active 
in vivo (Extended Data Fig. 2a-e). 


Architecture and flexibility of telomerase 

We prepared ATCAB1 telomerase with an hTR CAB box mutant 
that disrupted TCAB1 binding*>”* and separately prepared TERT- 
hTRmin?””8, in which hTRmin contains only the activity-essential 
t/PK and CR4/5 domains of hTR. By comparing their negative-stain 
2D class averages to those of the intact holoenzyme, we established 
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(nt) oligonucleotide was used as recovery control (RC). Processive 

repeat additions are numbered or indicated by dots. d, e, Representative 
negative-stain 2D class averages for fractions E1 and E2, respectively. We 
also observed a small fraction of E2-like particles in El (data not shown), 
while E2 contained predominantly the particles shown in e. Experiments 
in b and c were performed three times and twice, respectively, with similar 
results. For gel source data, see Supplementary Fig. 1. 


the particle regions corresponding to TCAB1, the H/ACA RNA motif 
and proteins, and TERT with the two activity-essential domains of 
hTR (Extended Data Fig. 1d-h). This revealed that one lobe of the 
holoenzyme contained TERT and its associated hTR motifs (the cat- 
alytic core) and the other contained the H/ACA RNP components 
and TCAB1 (the H/ACA lobe). We next collected cryo-EM data to 
determine the structure of the substrate-bound human telomerase 
holoenzyme (Extended Data Fig. 3a, b, Extended Data Table 2a). 
Initial image processing revealed extensive conformational flexibility 
between the two lobes, as well as within each lobe, which limited the 
quality of the overall reconstruction (Extended Data Figs. 3c, 4a). We 
performed focused classification with signal subtraction”? to obtain 
more homogeneous subsets of particles for each lobe separately. This 
procedure yielded reconstructions at 7.7 A and 8.2 A resolution for 
the catalytic core and H/ACA lobes, respectively (Extended Data 
Figs. 3c, 4b-e). At these resolutions, densities for protein a-helices, 
8-barrels, and double-stranded RNA are discernible, allowing us to 
unambiguously fit available protein and RNA structures or homology 
models, and ultimately to account for the majority of the density, 
except for flexible regions of single-stranded RNA and protein exten- 
sions (Fig. 2a, b, Extended Data Fig. 4f). We also obtained a recon- 
struction of the entire holoenzyme at 10.2 A resolution (Extended 
Data Fig. 3c). 
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Fig. 2 | Cryo-EM structure of the substrate-bound human telomerase 
holoenzyme. a, Schematic of subunit arrangements. b, Front (left) and 
back (right) views of the cryo-EM reconstructions for the H/ACA lobe at 
8.2 A and the catalytic core at 7.7 A, with fitted subunits colour-coded as 
indicated. 
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Fig. 3 | hTR structure in human telomerase holoenzyme. a, Schematic of 
secondary structure of hTR, with domains arranged based on the cryo-EM 
reconstruction. Regions modelled are highlighted with soft-coloured 
backgrounds. b, Front (left) and back (right) views of the structure of hTR 
highlighted within the human telomerase holoenzyme structure, with 
domains colour-coded as in a and unmodelled connections shown as 
dashes lines. 


Subunit distribution across the two lobes 

The bilobal human telomerase includes one TERT molecule and one 
hTR (Fig. 2a, b), in agreement with suggestions of an evolutionarily 
conserved single-TERT catalytic core for telomerases from ciliate and 
yeast model systems*”! to human!>-!®3?, The complex also contains 
one TCAB1 and two sets of the four mature H/ACA RNP proteins. The 
previous human telomerase negative-stain electron microscopy recon- 
struction also had a bilobal architecture, interpreted as dimeric TERT, 
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although gold-labelling experiments indicated the presence of a mix- 
ture of RNPs containing both one (50%) and two (36.5%) TERT mole- 
cules!” Incomplete separation of active from inactive TERT assemblies 
may account for the mixture of complexes in the previous study. Our 
finding of separate lobes for the catalytic core and the H/ACA RNP 
gives a structural explanation for why telomerase holoenzyme RNP 
biogenesis strategies have such plasticity across eukaryotes!’. 

The extended architecture of hTR scaffolds the overall subunit organ- 
ization across both lobes (Fig. 3a, b). The catalytic core and H/ACA 
lobes are bridged by stems Pla and P4.2 (Fig. 2a, b, 3b). The crystal 
structure of Tribolium TERT in complex with an RNA-DNA duplex 
could be readily fitted within the catalytic core density (Fig. 4a, Extended 
Data Figs. 5c, d, 6b). As the Oryzias latipes (medaka) TRBD struc- 
ture** includes vertebrate TERT motifs not present in the streamlined 
Tribolium TERT (Extended Data Fig. 7e), we removed extensions from 
the medaka TRBD structure that did not fit our density and used this 
model to replace the Tribolium TRBD (Extended Data Fig. 5b). Adjacent 
to the TERT ring, there is globular density with approximately the 
dimensions of the Tetrahymena TEN domain crystal structure™* (Fig. 4b, 
Extended Data Fig. 5a). The Tetrahymena and human TEN domains 
share only 20% sequence identity and have differences in predicted sec- 
ondary structure*; therefore, we removed regions of the former that are 
likely to be less structurally congruent with the human structure for ten- 
tative fitting into our map (Extended Data Fig. 6a). The NMR structure 
for the full human PK” and the crystal structure of the medaka CR4/5 
in the conformation bound to TRBD* were fitted in one side of the 
catalytic core (Figs. 2b, 3a, b, 4a, Extended Data Fig. 5h). The PK is con- 
nected to the template region located in the TERT active site by a bent 
helical RNA density that we attributed to the sequential junction and 
stem elements of P2 (P2b-J2a/b-P2a-J2a/a.1-P2a.1)'°3”"8 (Figs. 3a, 4a). 


Fig. 4 | Structure of the catalytic core. a, Front 
(left) and back (right) views of the catalytic 
core. Domains of human TERT and hTR are 
colour-coded as in Fig. 1a. hTR is in semi- 
transparent space-filling representation, and 
base triples within the PK are indicated in blue. 
b, Route of single-stranded RNA connecting 
hTR P2a.1 and the template, which threads near 
the TEN domain. c, TERT interactions with the 
substrate-template duplex. Active site residues 
are shown as blue spheres. The human TERT 
IFD insertion missing in the Tribolium TERT 

is indicated between helices «10 and a1 of 
the Tribolium TERT! (Extended Data Fig. 6b). 
A dotted line depicts the linker connection of 
TEN and TRBD domains. d, The human TRBD 
and CTE domains are encircled by hTR P2 
(yellow) and PK (orange) and CR4/5 (wheat) 
domains. e, f, Architecture of Tetrahymena 
TERT domains with TER motif colours as 
indicated’®: PK, orange; template recognition 
element (TRE), yellow; SL2, dark grey; SL4, 
wheat. Base triples within the PK are indicated 
in blue. For clarity, domains of TER coloured 
light grey in f are not shown in the structure in 
e. g, The Tetrahymena TRBD and CTE domains 
are encircled by t/PK (orange), TRE (yellow), 
SL2 (grey) and part of SL4 (wheat). Base triples 


; within the PK are indicated in blue. 
Base triples 
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Fig. 5 | The H/ACA RNP and disease mutations. a, Front (left) and back 
(right) views of the H/ACA lobe within the human telomerase structure, 
with colour-coded subunits: TCAB1, yellow; dyskerin, blue; GARI, red; 
NOP10, orange; NHP2, magenta; hTR, black. To distinguish between the 
two sets of H/ACA proteins, the first set bound to the 5’ hairpin (P4 stem) 
is in a lighter shade. b, Schematics of subunit arrangements within the 


We docked two copies of the crystal structure of an archaeal H/ACA 
RNP“4, each consisting of Cbf5 (orthologous to dyskerin), GAR1, 
NOP10 and L7Ae (structurally related to NHP2) and a single-hairpin 
RNA, into the H/ACA lobe (Figs. 2a-c, 5a, b, Extended Data Figs. 5e, f, 
7f, 8a-d). The single-hairpin RNA was truncated to a 9-bp stem 
flanked by the ACA motif to mimic P4 and P7 stems flanked by box 
H and ACA, respectively, in hTR. The hTR segments bound by each 
archaeal RNP equivalent can be unambiguously assigned on the basis 
of their connectivity with the rest of the RNA densities. The P4-P4.1 
stem is connected to the CR4/5 domain in the catalytic core via the 
P4.2 stem, and P7 is connected to the P8 stem-loop (Fig. 3a, b, 5a, b, 
Extended Data Fig. 5i). Density in the regions of P1, P4.1 and P4.2 
was fitted with A-form RNA duplexes. TCAB1 harbours a WD40 
domain, and its homology model fit uniquely at the top of the H/ACA 
lobe (Figs. 2b, 5a, b, Extended Data Figs. 5g, 7g), consistent with the 
missing density from the hTR CAB-box mutation (Extended Data 
Fig. 1h). This assignment of TCAB1 places it adjacent to an RNA- 
hairpin-like density that was fitted with the NMR structure of the 
P8 stem-loop*”. 


The catalytic core 

The t/PK and CR4/5 domains of hTR wrap around TERT to form 
the catalytic core (Fig. 4a). In turn, the human TERT ring encircles 
the fitted Tribolium TERT RNA-DNA bound to the TERT active site 
(Fig. 4a, c), which shifts the template position compared to that in 
the substrate-free Tetrahymena telomerase holoenzyme (Extended 
Data Fig. 7i). The majority of the TERT RNA binding surface is con- 
tributed by the TRBD and CTE, which are sandwiched between the 
PK and CR4/5 (Fig. 4d). The full human PK (P2b-P3) is connected 
to P2a by the J2a/b bulge, and P2a connects to the less conserved 
helical extension P2a.1 before the transition to the single-stranded 
3’-template-flanking region (Figs. 1a, 3a, 4a). The t/PK forms a rigid, 
arc-like structure (Figs. la, 4a), as predicted by modelling using NUR 
structures of its subdomains'*3**8, The less conserved end of the PK 
P3 stem approaches the TRBD close to its interface with the CTE and 
co-axially stacks with P2b (Fig. 4d). The unexpectedly limited interac- 
tion between the hTR PK and TERT favours a model for indirect PK 
function in the co-folding of TERT and hTR*™“°. The location of P2a.1 
positions the immediately 5’ single-stranded 3’-template-flanking RNA 
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H/ACA lobe. ¢, Interactions of TCAB1 with hTR and H/ACA proteins. 
The backbone amides of disease-associated mutations in dyskerin, NOP10 
and NHP2 that mapped near TCAB1 and P8 interaction surfaces are 
shown as spheres. d, Mapping of a cluster of disease mutations (backbone 
amides shown as spheres) in dyskerin near the dyskerin-dyskerin and 
dyskerin-RNA interfaces. 


such that it could thread between the TEN domain and the TERT ring 
(Fig. 4b, Extended Data Fig. 7c), similar to the path of the 3’-template- 
flanking TER in the Tetrahymena telomerase holoenzyme”? 
(Fig. 4e), despite major changes in the remainder of the t/PK second- 
ary and tertiary structure (Figs. la, 4f, Extended Data Fig. 7j). The 
proximity of the TEN domain to the template 3’ end further reinforces 
its proposed role in stabilization of RNA-DNA duplex at the template 
3’ end'*”’, The position of the Tetrahymena 5! template-boundary 
element, stem-loop 2 (SL2), relative to the TRBD, is roughly similar to 
that of human P1b (Fig. 4d, g), consistent with their common function 
in 5’ template boundary determination*’. 

The CR4/5 domain, which consists of P5, P6 and P6.1 stems con- 
nected as a three-way junction, adopts a similar conformation to that in 
the medaka CR4/5-TRBD crystal structure** (Fig. 4d, Extended Data 
Fig. 7d), except that human P6 is longer and composed of two stems, 
P6a and P6b. The Péa stem binds along TRBD (Fig. 4d), while the P6.1 
hairpin emanates from the three-way junction almost perpendicularly 
to Péa, inserting into the interface between the TRBD and CTE. Clear 
helical density is observed for P6b, which makes no RNA-RNA or 
RNA-protein interactions. P5 co-axially stacks with P6a, but like P6b, 
it does not interact with TERT (Fig. 4d, Extended Data Fig. 7d). The 
P6.1-TERT interaction is critical for TERT association with hTR and 
telomerase catalytic activity*”**, The P6.1 apical loop was disordered 
in the TRBD-CR4/5 crystal structure* and, although our resolution is 
insufficient for de novo building, its density is visible in our cryo-EM 
map, appearing packed against the CTE (Extended Data Fig. 7d). Our 
structure suggests that interaction with the CTE stabilizes this loop, 
consistent with previous crosslinking studies’. In the Tetrahymena 
telomerase cryo-EM structure, the stem-terminus element (SL4) is 
bound at the interface between the TRBD and CTE of TERT”, similar 
to P6.1 in our human telomerase structure (Fig. 4d, 4g). Whether or not 
they are related evolutionarily, P6.1 and SL4 serve a similar structural 
role in telomerase catalytic core architecture. Finally, the catalytic core 
has extra density that is likely to correspond to TERT and hTR regions 
that were not possible to model (Extended Data Fig. 4f), such as the 
‘insertion in fingers’ (IFD) subdomain of the RT domain (residues 
734-802) that is missing in Tribolium TERT (Extended Data Figs. 6b, 
7a, b), and the non-conserved flexible linker between the TEN domain 
and the TERT ring (Fig. 4c). 
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The H/ ACA RNP and human disease mutations 
Our structure offers the first glimpse of the subunit architecture of an 
intact eukaryotic H/ACA RNP, with two sets of H/ACA heterotetramer 
proteins that bind two RNA hairpins (Fig. 5a, b). As expected, the 3’ 
hairpin loop associates with the single holoenzyme TCABI subunit 
(Fig. 5a—c). We observe an interaction between the two H/ACA protein 
heterotetramers, predominantly mediated by the two dyskerin mole- 
cules (Fig. 5d, Extended Data Fig. 7h). The first set of H/ACA proteins 
contacts the 5’ hairpin P4 stem exclusively via dyskerin (Fig. 5a, b, 
Extended Data Fig. 5e). By contrast, the second set forms more extensive 
interactions with the 3’ hairpin P7 stem and P8 stem-loop. The P7 stem 
is bound by dyskerin, similarly to the 5’ hairpin P4, and continues to P8, 
which interacts with NOP10, NHP2 and TCABI (Fig. 5a, b, Extended 
Data Fig. 5f, i). The fact that binding of dyskerin at the base of the 5 
hairpin is sufficient to anchor the entire core heterotetramer accounts for 
the evolutionary divergence of the vertebrate TER 5’ H/ACA hairpin*® 
and for the hTR 5’ hairpin tolerance of changes in stem and pocket 
structure*®. Loss of hTR 5’ hairpin contacts to H/ACA proteins is 
compensated by an hTR-specific increase in protein binding to the 
3’ hairpin'®. The position of dyskerin defined by our structure explains 
why H/ACA RNP subunit exchange occurs in cell extracts for GARI, 
NOP10 and NHP2, but not dyskerin*’. The surprisingly limited protein— 
RNA interaction of the hTR 5’ hairpin implies that a wider breadth of 
RNA structures can assemble H/ACA proteins than has been predicted 
from the snoRNAs and scaRNAs, which conserve not only H/ACA pro- 
tein binding but also requirements for dyskerin catalytic function”'. 
The single subunit of TCAB1 in human telomerase contacts both the 
3’ hairpin CAB box and dyskerin in the second set of H/ACA proteins 
(Fig. 5c, Extended Data Fig. 7g), giving a unifying explanation for its 
simultaneous identification as an RNA-independent dyskerin bind- 
ing protein and a sequence-specific, direct RNA binding protein**”. 
Unexpectedly, TCAB1 also contacts GARI (Fig. 5c, Extended Data 
Fig. 7g). Protein—protein interactions by TCAB1 explain why such a 
limited amount of CAB-box sequence can stabilize its association with 
H/ACA RNPs. Structure fitting of the archaeal H/ACA RNP, combined 
with sequence homology between the archaeal and human proteins, 
allowed mapping of human dyskeratosis congenita and Hoyeraal- 
Hreidarsson disease mutations in our structure*’ (Extended Data 
Fig. 8a—d). Notably, a large number of dyskerin mutations cluster at 
the dyskerin-dyskerin interface, which is also close to its RNA binding 
surface (Fig. 5d, Extended Data Fig. 7f, h). In addition, dyskerin R158 
is near TCAB1, and NOP10 R34 and NHP2 V126 are near the P8 stem 
(Fig. 5c). Mutations at any of these newly evident interfaces would be 
expected to have more severe impact on hTR than snoRNAs or scaR- 
NAs for three reasons: (a) crippled protein-RNA interaction of the hTR 
5‘ hairpin makes 5’ hairpin assembly more dependent on cross-hair- 
pin dyskerin—dyskerin interactions; (b) hTR is uniquely reliant on 
enhanced H/ACA-protein affinity for the 3’-hairpin P8 stem-loop; 
and (c) hTR has one rather than two CAB box locations for TCAB1 
association'*“°. Our ability to map disease mutations to the H/ACA 
RNP structure expands the scope of telomerase deficiency mechanisms 
beyond decreased assembly of the core heterotrimer (dyskerin, NOP 10, 
NHP2)* to include decreased dyskerin interaction with itself and with 
TCAB1 as well as with hTR. 


Conclusions 

The structure presented here provides, to our knowledge, the first 
architectural characterization of a telomerase holoenzyme from a mul- 
ticellular species, addressing the longstanding question of vertebrate 
telomerase holoenzyme subunit content and arrangement. Our struc- 
ture opens new opportunities for the design and screening of inhibitors 
targeting hTR folding and TERT interaction that will hold promise as 
general anti-cancer therapeutic agents’. 
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METHODS 

Telomerase reconstitution. Human telomerase was reconstituted in cells with 
the TERT expression plasmid pcDNA3.1-ZZ-Tev-TwinStrep-TERT expressing 
ZZ-SS-TERT, which has a Tobacco Etch Virus protease cleavage site between the 
tandem Protein A and TwinStrep tag modules, and the hTR expression plasmid 
pcDNA3.1-U3-hTR-HDV. HDV is the hepatitis delta virus ribozyme, which was 
previously shown to increase hTR accumulation”*“*. Initially we used suspension 
HEK 293 cells lacking N-acetylglucosaminyltransferase I (GnTI)*! and transfected 
the cells using polyethylenimine (PEI)*’. These cells were cultured in FreeStyle 293 
Expression medium (ThermoFisher). After 48 h of transfection, whole cell extract 
was prepared by CHAPS lysis°*. We obtained populations of inactive and active 
telomerase RNP purified from these extracts. To enrich for the active telomer- 
ase RNP, we used adherent HEK 293T cells cultured in DMEM with GlutaMAX 
(ThermoFisher) supplemented with 10% FBS, 100 units per ml penicillin and 
100|1g/ml streptomycin. The cells were transfected with TERT and hTR expres- 
sion plasmids by calcium phosphate. After 48 h, the cells were harvested and 
resuspended in hypotonic lysis buffer (HLB) (20 mM HEPES NaOH pH 8, 2mM 
MgCh, 0.2mM EGTA, 10% glycerol, 0.1% NP40, 1mM DTT and 0.1 mM PMSF). 
Whole-cell extract was prepared by three freeze-thaw cycles and adjusted to 
300 mM before being cleared by centrifugation. The clarified extract was adjusted 
to 150mM NaCl. For ATCAB1 telomerase and TERT-hTRmin, we used a previ- 
ously characterized CAB-box mutant hTR (G414C)*> and LhTRmin”®. HEK 293 
cell lines are long-term laboratory stocks and had the expected morphology and 
growth. They have not been authenticated or tested for mycoplasma recently. 
Telomerase purification. The prepared whole-cell extract (from 120 plates of 
adherent cells or 6 | of cell suspension) was incubated with streptavidin agarose 
resin (Sigma-Aldrich) pre-bound to a 5’-biotinylated oligonucleotide’® at room 
temperature for 3h. The resin was washed with wash buffer (20 mM HEPES NaOH 
pH 8.0, 150mM NaCl, 2mM MgCh, 0.2mM EGTA, 10% glycerol, 0.1% Igepal 
CA-630, 1mM DTT) and eluted with a competitor oligonucleotide’. The eluate 
was subsequently incubated with MagStrep XT resin (IBA LifeSciences) for 2h at 
4°C. The resin was washed with wash buffer and eluted in batch with biotin elution 
buffer (100 mM Tris HCl pH 8.0, 150mM NaCl, 1mM EDTA, 5-10 mM biotin, 
2mM MgCh, 0.1% Igepal CA-630, 10% glycerol, 1 mM DTT). We collected three 
sequential batch elution fractions for which the elution buffer was incubated with 
the resin for 30 min each time. For mass spectrometry, the eluate was combined and 
precipitated using trichloroacetic acid (TCA). The pellet was washed extensively 
with acetone and resuspended in water. The sample was analysed by MudPIT to 
identify the protein components present in the sample™. For cryo-EM studies, the 
final eluate was concentrated and buffer-exchanged with cryo-EM buffer (20mM 
HEPES NaOH pH 8.0, 150mM NaCl, 2mM MgCh,, 0.05% Igepal CA-630, 1% tre- 
halose, 1mM DTT). To prepare DNA-substrate bound telomerase, oligonucleotide 
T14AG; (11M) was added in the MagStrep binding step, and excess DNA substrate 
was washed away. Substrate-bound telomerase was eluted as described above. 
Immunoblotting. The purified telomerase samples (El and E2) from the CHAPS 
lysis protocol were resolved by bis-Tris SDS-PAGE. Proteins were transferred to 
a nitrocellulose membrane and blocked with 4% nonfat milk (Carnation) in TBS 
buffer (50 mM Tris pH 7.5, 150mM NaCl) for 1h at room-temperature. The mem- 
branes were incubated with primary antibodies overnight at 4°C and washed with 
TBS before being probed with secondary antibodies (Invitrogen) in 4% nonfat milk 
in TBS for 1h at room-temperature. We blotted for TERT, TCAB1, dyskerin and 
GARI on the same membrane, and NHP2 and NOP10 on a different membrane. 
The small sizes of NHP2 and NOP10 required a shorter protein transfer time 
than the other proteins. The membranes were washed extensively with TBS and 
visualized on a LI-COR Odyssey imager. The primary antibodies used were rabbit 
anti-dyskerin (Santa Cruz Biotechnology), mouse anti-TCAB1 (Sigma-Aldrich), 
rabbit anti-GAR1 (Novus), rabbit anti- NHP2 (Proteintech), rabbit anti- NOP10 
(Abcam), mouse anti-Strep (Abcam), mouse anti-tubulin (Abcam) and rabbit IgG 
(Sigma-Aldrich). 

Direct telomerase activity assays. Direct primer extension assays were performed 
using 1 11 sample in a 20-1] reaction volume containing 50 mM Tris-acetate pH 
8.0,4mM MgCl, 5mM DTT, 2501.M dTTP, 250M dATP, 511M unlabelled 
dGTP, 0.141.M a-*?P dGTP (3,000 Ci/mmole, 10 mCi/ml, Perkin-Elmer), 500nM 
5/-(T,AG3)3-3’ primer unless primer was pre-bound. Reactions were incubated at 
30°C for 40 min and stopped with TES buffer (50mM Tris HCl pH 7.5, 20 mM 
EDTA, 0.2% SDS). The products were extracted and precipitated together with 
a 12-nt **P-end-labelled oligonucleotide as a recovery control and resolved on a 
denaturing polyacrylamide gel. The dried gels were exposed to a phosphorimager 
screen and imaged on a Typhoon Trio system (GE Healthcare). 

HCT116 cell culture and genome editing. HCT116 cells were cultured in DMEM 
with GlutaMax (ThermoFisher) supplemented with 10% FBS and 100,1g/ml 
Primocin (Invivogen). Clonal cell lines with homozygous TERT disruption were 
generated previously”*. Transgene integration at the AAVS1 safe harbour locus was 
conducted as previously described* using CAGGS promoter for TERT expression 
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and a neomycin resistance cassette for selection. Genome engineering in HCT116 
cells was performed using the manufacturer's transfection protocol (Lipofectamine 
3000, ThermoFisher) with media exchange at 24h into selection with 200,1g/ml 
G418 and 100j1g/ml hygromycin. Cells were passaged continuously with selec- 
tion medium for more than 3 months after selection for cells with the integrated 
TERT transgenes. HCT116 cells were obtained from Molecular and Cell Biology 
Department Tissue Culture Facility (UC Berkeley); this cell line was validated by 
short tandem repeat profiling with GenePrint and tested for mycoplasma by the 
Tissue Culture Facility. 

Telomeric repeat amplification assays. Cell extract was prepared using the 
HLB method as described above and normalized to 2 mg/ml, which was quan- 
tified using the Bio-Rad Bradford protein assay. Dilutions were made of the 
whole cell extract, which were used for the PCR-based telomeric repeat ampli- 
fication protocol (TRAP) as previously described°°. Two microlitres with 
400 ng total protein was used per 20,11 QTRAP reaction, consisting of 0.1 p.g/j1l 
TS primer (5’-AATCCGTCGAGCAGAGTT-3’), 0.1 pg/pl ACX primer 
(5'-GCGCGGCTTACCCTTACCCTTACCCTAACC-3’), and iTaq Universal 
Green Supermix (BioRad). Samples were incubated on a CFX96 Touch Real- 
Time PCR Detection System (Bio-Rad) at 30°C for 30 min, followed by a 
hot start of 95°C for 2 min. The samples were then cycled 35 times between 
95°C for 20s and 61°C for 90s. Relative telomerase activity was calculated 
using AC; normalized to the parental sample. Serial dilutions of 10, 30, and 
100 ng were used for the hotTRAP protocol. The TS and ACX primers were 
used to amplify telomeric repeats labelled by incorporation of «-*?P-dGTP. 
TSNT (5’-AATCCGTCGAGCAGAGTTAAAAGGCCGAGAAGCGAT-3’) 
was used as the internal control (IC) for all samples, and NT (5/- 
ATCGCTTCTCGGCCTTTT-3’) was the dedicated TSNT reverse primer. Samples 
were resolved on a native gel. The gel was dried, exposed on a phosphorimager 
screen and imaged on a Typhoon scanner (GE Healthcare). 

Southern blotting. Genomic DNA preparation was conducted as previously 
described”®. Four to eight micrograms of DNA was digested for between 8h and 
overnight with Alul and Mbol at 37°C before electrophoresis on a 0.7% agarose gel 
(Seakem ME agarose). The gel was dried and denatured in 1.5 M NaCl and 0.5M 
NaOH for 1h at 50°C. The gel was then washed twice with 4x SSC + 0.1% SDS 
before prehybridization with Church's buffer (1% BSA, 1mM EDTA, 0.5M Na3PO4 
pH 7.2, 7% SDS) for 30 min at 50°C. **P-end-labelled probes were made of the telo- 
meric repeat (T,AG3)3 as well as the O’Generuler 1 kb plus ladder (ThermoFisher). 
These probes were both added and hybridized overnight at 50°C. The gel was 
washed with 4x SSC + 0.1% SDS at 40°C before exposing on a phosphorimager 
screen and imaging on a Typhoon scanner (GE Healthcare). 

Negative-stain electron microscopy and data processing. For negative-stain grid 
preparation, 411 telomerase sample was incubated on glow-discharged 400-mesh 
copper grids, which had been previously coated with a homemade carbon film on 
nitrocellulose, for 90s and stained with 2% w/v uranyl formate. Micrographs for 
the inactive and active fractions of telomerase were collected on an FEI Tecnai 12 
transmission electron microscope operated at 120kV and equipped with a TVIPS 
TemCam F416. Data was collected using the Leginon package” and data process- 
ing was done in RELION 1.4* unless otherwise indicated. Initial particle picking 
of ~1,500 particles was performed manually, particles were extracted with a 220° 
pixel box at a pixel size of 2.18 A per pixel and subjected to reference-free 2D 
classification to obtain 2D initial references for auto-picking within RELION™. 
A total of 51,423 and 26,350 particles were picked for the inactive and active frac- 
tions, respectively. We performed two rounds of reference-free 2D classification 
for each dataset. 

Micrographs for the enriched active fraction of intact holoenzyme, ATCAB1 
telomerase, and TERT-hTRmin samples were collected on an FEI Tecnai F20 
transmission electron microscope operated at 120kV and equipped with a Gatan 
US4000 CCD camera. Data collection was done using the Leginon software 
package™. Initial particle picking for intact telomerase holoenzyme was done using 
DOG PICKER® within the APPION pipeline. This yielded approximately 16,000 
particles, which were extracted using a 400” pixel box at a pixel size of 1.5 A per 
pixel and subjected to reference-free 2D classification to obtain initial 2D class 
averages. These 2D class averages were subsequently used as references for auto- 
matic picking within RELION® to yield 15,293 and 23,922 particles for intact 
telomerase holoenzyme and ATCAB1 mutant, respectively. Reference-free 2D 
classification was again performed on these particles. For the TERT-hTRmin com- 
plex, which is a much smaller RNP than telomerase holoenzyme, DOG PICKER 
was initially used to pick approximately 48,000 particles, which were extracted 
using a 180? pixel box at a pixel size of 1.5A per pixel. 2D class averages resulting 
from reference-free 2D classification of these particles were used as references for 
automatic picking in RELION to give 50,720 particles, which were subjected to 
another round of reference-free 2D classification. A comparison of the 2D class 
averages of intact telomerase holoenzyme, ATCAB1 mutant and TERT-hTRmin 
is shown in Extended Data Fig. leh. 
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Cryo-electron microscopy. For cryo-EM analysis, 3 11 of the purified telomerase 
was applied to C-flat CF 4/2 grids (Protochips) that had been previously coated 
with a 6-nm-thick layer of homemade continuous carbon film and glow-discharged 
in amylamine®! to overcome the preferred orientation of the telomerase holoen- 
zyme on untreated carbon film. The grids were blotted for 4-5 s at 100% humidity 
and 4°C and plunged into a mixture of liquid ethane and propane cooled by liq- 
uid nitrogen™ using an FEI Vitrobot MK IV. The grids were loaded onto an FEI 
Titan Krios transmission electron microscope operated at 300kV and equipped 
with a Gatan K2 Summit direct electron detector camera mounted behind a GIF 
Quantum energy filter. Images were collected automatically using the Serial-EM 
package® in super-resolution counting mode at a calibrated magnification of 
43,477 x (physical pixel size of 1.15 A per pixel), with a total electron dose of 42 
electrons per A? during a total exposure time of 7s, dose-fractionated into 28 movie 
frames. We used a slit width of 25 eV on the energy filter and a defocus range of 
1.5-3.5um. A total of 11,654 micrographs were collected from two separate data 
collection sessions. 

Image processing. Most steps of data processing were done in RELION 2.0™ unless 
otherwise stated (Extended Data Fig. 3c). The movie stacks for each micrograph 
were corrected for drift and dose-weighted using MOTIONCOR2® running 
within Focus during data collection. Contrast transfer function (CTF) parameters 
were estimated for the drift-corrected micrographs using GCTF”. Initial particle 
picking was performed on a subset of 350 micrographs using Gautomatch (from 
K. Zhang, MRC-LMB, Cambridge) and templates from a previous dataset 
collected on substrate-free telomerase at a different magnification (data not used 
and not shown in this study), resulting in approximately 32,000 particles. These 
particles were extracted with a 400? pixel box for reference-free 2D classification 
to obtain 2D references for automatic picking of all micrographs by Gautomatch. 
A total of 1,091,177 particles from 11,450 micrographs were picked, binned by 2 
and extracted with a box size of 200? pixel. The data were initially processed in two 
batches, one for each data collection session. 3D classification into five classes was 
run for 30 iterations with an angular sampling of 7.5°, a regularization parameter 
T of 4anda60A low-pass filtered initial model obtained from the previous dataset 
of substrate-free telomerase (data not used and not shown in this study). The 
initial model used for the substrate-free telomerase was that of the published 30 A 
negative stain reconstruction of human telomerase!”. For dataset 1, a subset of 
121,116 particles from one class with the best features was selected and un-binned. 
For dataset 2, we selected two 3D classes and subjected them to one round of refer- 
ence-free 2D classification to remove more particle images showing contaminants 
or damaged telomerase complexes. A subset of 214,157 particles from this dataset 
was unbinned and combined with the subset of particles selected from dataset 1. 
The combined subset of 335,283 particles was refined and yielded a reconstruc- 
tion that was hard to interpret due to the conformational heterogeneity between 
the two lobes and within each lobe. The angular assignments resulting from this 
refinement were used for focused 3D classification with signal subtraction for each 
lobe to resolve the conformational heterogeneity within each lobe”’. The two lobes 
are henceforth referred to as the H/ACA lobe and the catalytic core, according to 
our subunit assignments using negative-stain EM (Extended Data Fig. le-h). The 
best class consisting of 103,363 particles for the catalytic lobe was refined to 7.8 A 
resolution while that for the H/ACA lobe was refined to 8.8 A resolution from 
104,330 particles. These reconstructions show clear secondary structure features as 
expected for this resolution range. To further reduce conformational heterogeneity, 
we performed another round of alignment-free 3D classification, using the align- 
ment parameters from the refinement run for each lobe, and refined the resulting 
best class. We obtained a 7.7 A resolution reconstruction from 39,860 particles for 
the catalytic core and 8.2 A resolution reconstruction from 36,210 particles for 
the H/ACA lobe. These reconstructions were used for model fitting (see below). 

We also performed alignment-free 2D classification on the original (unsub- 
tracted) particles using the orientations determined by the 3D refinement of each 
lobe to assess the relative orientation of the other lobe (Extended Data Fig. 4a). 
For both cases, the other lobe adopts a wide range of conformations relative to the 
lobe that is aligned, clearly showing that the quality of any reconstruction for the 
whole structure is limited by both the continuous movements between the two 
lobes relative to each other and the conformational heterogeneity within each lobe. 
To obtain a better reconstruction encompassing both lobes, we performed global 
3D classification for the subset of 335,283 particles before signal subtraction (25 
iterations, T= 8, angular sampling of 7.5°) (Extended Data Fig. 3c). As expected 
for a particle population exhibiting continuous motion, it resulted in four classes 
with nearly equal distribution and the two lobes adopting different conformations 
relative to each other. We refined each of these classes and obtained 10.2 A reso- 
lution for the best class. 

All refinements were performed using fully independent data half-sets 
(gold-standard refinement) and resolutions are reported based on the Fourier 
shell correlation (FSC) = 0.143 criterion (Extended Data Fig. 4b). FSC curves 
were calculated with a soft mask. For visualization, maps were corrected for the 
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modulation transfer function of the detector and sharpened by applying a negative 
B factor as determined by the post-processing function of RELION. Local resolution 
(Extended Data Fig. 4d, e) was calculated within RELION 2.0. For model versus 
map FSC curves calculations, we converted the catalytic core and H/ACA RNP 
models into density using EMAN”’. Owing to small regions of unmodeled density 
in our maps (Extended Data Fig. 4f), we prepared a soft mask (6-pixel soft edge) 
from the model and extracted the cryo-EM maps of each model using this mask. 
FSC model versus map was calculated using Xmipp”! (Extended Data Fig. 4c). 
Model fitting. Initial fitting of all X-ray structures, NMR ensembles and homology 
models was performed manually by visual inspection of the best maps of 
the individual two lobes, followed by fit optimization in Chimera”? (Extended 
Data Table 2b, Supplementary Data 1 and Supplementary Video 1). The fitted 
structure for each lobe was subsequently rigid-body fitted into the overall map 
resulting from refinement of the best 3D class from global 3D classification. A 
homology model for TCAB1 WD40 domain was obtained using Phyre”* based 
on the structure of coatomer subunit alpha in the COPI coat linkage structure” 
(PDB 5A1V) and fitted into the density located by comparisons of the 2D class 
averages for wild-type and ATCAB1 telomerase (Extended Data Fig. le-h). The 
Tetrahymena TEN domain (PDB 2B2A) and the TRBD, RT and CTE domains of 
the Tribolium TERT’? (PDB 3KYL) were fitted by rigid-body fitting into the map 
separately. We compared the fittings of the Tribolium TRBD and the medaka TRBD 
(PDB 4026) (Extended Data Fig. 7e). The former has a truncated TRBD, while 
the latter is closely related to the hman TRBD in sequence. Some of the medaka 
TRBD extensions close to the RNA binding surface fit our map, while those in the 
periphery of the domain (including residues 403-425 and residues 545-572) did 
not. We removed extensions that did not fit from the medaka TRBD and replace 
the Tribolium TRBD with this truncated version. Our substrate can form a 6-bp 
duplex with hTR template region. Thus we truncated the DNA/RNA duplex model 
in the duplex-bound Tribolium TERT (PDB 3KYL) to 6 bp. Overall, the relative 
orientations of the domains and the DNA-RNA duplex are very similar to that 
of the crystal structure. For the H/ACA RNP, the RNA used in the single-hairpin 
archaeal crystal structure (PDB 2HVY) was truncated to a 9-bp hairpin flanked 
by the 3’ ACA motif. We used this model to fit both H/ACA copies because no 
structure of H/ACA proteins bound to the H box is available. Fitted structures of 
available hTR domains include the human P2b-P3 pseudoknot** (PDB 2K95), 
P2a-J2a/b-P2b*” (PDB 2L3E), P2a.17*, P8 stem (or CR7)*? (PDB 2QH2), and the 
medaka TRBD-bound CR4/5°? (PDB 4026). The P4 and P7 stems bound by the 
H/ACA proteins were adapted from the truncated archaeal hairpin RNA model, 
as mentioned above. We used COOT” to place idealized A-form RNA helices in 
the map regions assigned to the P1, P4.2, P4.2 and Péb stems. 

Map and model visualization. Maps were visualized in Chimera” and all model 
illustrations were prepared using either Pymol (https://www.pymol.org) or 
Chimera. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. Figure source data are provided in Supplementary Fig. 1. 
Cryo-EM maps of the catalytic core, H/ACA lobe and overall human telomer- 
ase holoenzyme have been deposited with the Electron Microscopy Data Bank 
under accession numbers EMD-7518, EMD-7519 and EMD-7521, respectively. 
The PyMol session used for fitting of homology models into the EM density is 
provided as Supplementary Information. 
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Extended Data Fig. 1 | Protein identification by immunoblotting, 
enriching active telomerase, substrate pre-binding, and comparison 

of intact, ATCAB1, and TERT-hTRmin RNPs. a, Immunoblotting 

of TERT, TCAB1, dyskerin, GAR1, NHP2 and NOP10 in telomerase 
purified after CHAPS lysis protocol as shown in Fig. 1b. We used primary 
antibodies against each protein, except ZZ-SS-TERT, for which we used 
rabbit IgG. Owing to the wide range of molecular weights of the proteins 
in our sample, TERT, TCAB1, dyskerin and GARI were detected in one 
blot, while NHP2 and NOP10 were detected in a separate blot. The use 

of the same sample to probe all proteins was performed only once, but 
TERT, dyskerin and TCAB1 were also probed individually twice. b, Silver- 
stained SDS-PAGE gel of purified telomerase fractions obtained from 
adherent cells lysed using the hypotonic lysis method, which enriches 
active telomerase. This experiment was repeated more than five times with 
similar results. c, Direct primer-extension assays of the purified telomerase 
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fractions shown in b, confirming that E1 is no longer inactive (left), and 
of the substrate-bound purified telomerase fractions with additional 
DNA substrate omitted from the assays (right). The activity observed 
confirmed that purified telomerase contains the DNA substrate. The 
activity assays with substrate added were repeated over five times and the 
activity assays with substrate pre-bound were repeated twice. All repeats 
showed similar results. d, Silver-stained SDS-PAGE gel of purified intact 
and ATCAB1 telomerase and TERT-hTRmin telomerase prepared for 
subunit assignments. This experiment was done only once to provide a 
direct comparison between these different purified telomerase complexes. 
e-g, Negative-stained 2D class averages of intact and ATCAB1 telomerase 
and TERT-hTRmin, respectively. h, Comparison of representative 2D class 
averages of intact and ATCAB1 telomerase and TERT-hTRmin showing 
the inferred localization of TCAB1 and TERT. For gel source data, see 
Supplementary Fig. 1. 
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Extended Data Fig. 2 | Cellular function of the tagged TERT used 

for structural analysis. a, Western blot detection of ZZ-SS-TERT in 
TERT knockout (KO) cells rescued by untagged TERT or ZZ-SS-TERT 
expression. Whole-cell extracts were probed using Strep antibody. 
HCT116 is the parental cell line. Lysate prepared from HEK 293T cells 
transiently transfected with ZZ-SS-TERT and hTR was used as a positive 
control (Ctrl). Tubulin was detected as a loading control. This experiment 
was performed only once to confirm the success of ZZ-SS-TERT 
incorporation into the HCT116 TERT KO cells. b, Telomeric restriction 
fragment analysis of HCT116 parental cells, TERT KO cells (before 
senescence), and TERT KO cells rescued with untagged or ZZ-SS-TERT 
transgene. Transgene-expressing cells were sampled at 31, 62 and 98 days 
post-transfection with transgene vectors. This experiment was performed 
twice with similar results. c, TRAP assay detection of telomerase activity 
in HCT116 parental cells, TERT KO cells, and TERT KO cells rescued 


with untagged TERT or ZZ-SS-TERT transgene. Whole-cell extracts were 
normalized by total protein concentration and assayed at 100, 30 or 10 ng 
of total protein per reaction. IC, internal control. This experiment was 
repeated four times with similar results. d, Quantification of Q-TRAP 
assay detection of telomerase activity in HCT parental cells, TERT KO 
cells and TERT KO cells rescued with untagged TERT or ZZ-SS-TERT 
transgene. Error bars were calculated by taking the s.d. of the average 
ACt from four time points. Data points were shown as overlays. e, Direct 
primer-extension assay of telomerase after template-complementary 
oligonucleotide purification from extracts of TERT KO cells rescued by 
untagged TERT or ZZ-SS-TERT transgene. Assays were performed on 
clarified cell lysate (crude), flow-through (O-FT) and elution (OE) using 
equivalent amounts of cell extract. This experiment was performed only 
once to re-confirm the results in c. For gel source data, see Supplementary 
Fig. 1. 
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Extended Data Fig. 3 | Image processing procedures. a, Representative b, Representative 2D class averages obtained from reference-free 2D 
raw micrograph. We collected a total of 11,654 micrographs for this study. classification. c, Data processing strategy used in this study. 
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Extended Data Fig. 4 | Resolution estimation and analysis of the 
flexibility of the complex. a, Representative 2D class averages obtained 
from 2D classification without alignment of particles that were aligned on 
either the catalytic core or the H/ACA lobe. For both cases, the other lobe 
adopts a wide range of conformations, as illustrated by the blurriness of 
the density. b, FSC curves for the overall map and the maps of the catalytic 
core and H/ACA lobe resulting from focused classification with signal 
subtraction and gold-standard refinement. c, Model versus map FSC 
curves for the catalytic core and the H/ACA RNP. We fitted only homology 
models as rigid bodies into the map and did not perform model coordinate 
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refinement owing to the limited resolutions of the maps. Therefore, we 
used a lower FSC threshold of 0.25 for resolution estimates. d, e, Local 
resolution for the catalytic lobe (d) and the H/ACA lobe (e) estimated by 
RELION 2.0°4. Most of the catalytic core is resolved at 6-8 A while most of 
the H/ACA lobe is resolved at 7-9 A. f, Front (left) and back (right) views 
of the reconstruction showing modelled (grey) and unmodelled (gold) 
density. Most of the unmodelled density corresponds to single-stranded 
RNA regions or RNA bulges, and human protein extensions that cannot be 
built de novo at this resolution. 
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map. a-d, Domains of TERT. a, The TEN domain from Tetrahymena™ (top) and back (bottom) views of the 3’ hairpin set of H/ACA proteins 
(PDB 2B2A). b, The truncated medaka TRBD domain* (PDB 4026). using the same model and colour scheme as e. g, Homology model of 

c, d, The RT and CTE domains from Tribolium™ (PDB 3KYL). e, Front TCAB1 WD40 domain. h, Front (top) and bottom (bottom) views of hTR 
(top) and back (bottom) views of the 5’ hairpin set of H/ACA proteins in the catalytic core. i, hTR in the H/ACA lobe. 
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Extended Data Fig. 6 | Sequence alignment of TERT with secondary 
structure assignments based on known structures. a, Sequence 
alignment of Tetrahymena and human TEN domains. The secondary 
structure assignments of the Tetrahymena TEN domain** (PDB 2B2A) 

are shown above the aligned sequences. Regions removed before fitting 
are indicated with dashed lines below the sequences. b, Sequence 
alignment of the Tribolium, human and Tetrahymena TERT, with the latter 
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two Sacaiale truncated to match Tribolium. Secondary structure 
assignments of the Tribolium TERT are shown on top, with conserved 
motifs labelled in blue. Throughout the figure, the 7 symbol refers to a 
31o-helix. Strict 6-turns and strict a-turns are displayed as TT and TTT. 
The three catalytic aspartic acids are indicated with black arrowheads. 
ESpript was used to generate this figure”® 
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Extended Data Fig. 7 | Selected protein-protein and protein-RNA 
interactions in telomerase holoenzyme and comparisons between 
human and Tetrahymena TERT. a, Interactions between the RT and CTE 
domains of TERT and the substrate-template duplex. The RT domain is 
divided into two subdomains, the palm (green) and fingers (orange), that 
are commonly observed in retroviral reverse transcriptases. The CTE 
(cyan) is the putative thumb. The IFD insertion that is missing in the 
Tribolium TERT is indicated. b, Region of the cryo-EM reconstruction 
shown in a. Unassigned density close to the IFD insertion is highlighted in 
magenta. c, Cryo-EM density of the TEN domain in the same view as that 
in Fig. 4b. Connecting density is observed between the template region 
and the P2a.1 stem. d, Map of the CR4/5 three-way junction (wheat) 

and the nearby TERT domains highlighting the position of the P6.1 loop 
near the interface of the CTE (cyan) and TRBD (blue) domains of TERT. 
This loop was not ordered in medaka CR4/5 bound to the TRBD alone*’. 
e, Comparison of the Tribolium (left) and medaka (right) TRBD with 
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the medaka CR4/5 domain of hTR'*?, Extensions of the medaka TRBD 
that did not fit the map were truncated for visualization. f, Cryo-EM map 
with H/ACA components fitted. g, h, Detailed views of regions boxed in f 
show TCAB1 interactions with dyskerin, GAR1 and the P8 stem-loop (g), 
and interactions between the two dyskerin molecules (h), where a cluster 
of DC mutations are found (Fig. 5d). i, Comparison of the human and 
Tetrahymena TERT superposed on the RT domain. Domains of human 
TERT are coloured as in Fig. 1a, while Tetrahymena TERT is coloured grey. 
The bound human and Tetrahymena templates are coloured dark and light 
red, respectively. j, Comparison of human and Tetrahymena’? catalytic 
cores fitted into the corresponding cryo-EM maps. Domains of TERT were 
coloured as in Fig. la and TER is coloured yellow. We used the catalytic 
core and H/ACA lobe densities resulting from our focused classification/ 
refinement for the human telomerase and the overall 9.4 A Tetrahymena 
telomerase map (EMD-6442). 
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Extended Data Fig. 8 | Sequence alignments of H/ACA proteins 

with secondary structure assignments based on known structures. 

a-d, Sequence alignments of Pyrococcus furiosus (archaeal) and human 
Cbf5/dyskerin (a), GAR1 (b), NOP10 (c), and L7Ae/NHP2 (d). Secondary 
structure assignments displayed on the top are from the archaeal H/ACA 
RNP structure”* (PDB 2HVY). The 1 symbol refers to a 3yo-helix. Strict 


B-turns and strict «-turns are displayed as TT and TTT, respectively. 
Known human dyskeratosis congenita and Hoyeraal-Hreidarsson 
disease mutations®’ in H/ACA proteins are indicated with arrowheads. 
Blue arrowheads indicate residues that can be mapped onto the archaeal 
structure and black arrowheads indicate residues that were not mapped. 
ESpript was used to generate this figure’®. 
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Extended Data Table 1 | Mass spectrometry analysis of the purified telomerase sample 


Protein name Sequence count Sequence coverage (%) Length (aa) MW (KDa) 

NHP2 (H/ACA subunit 2) 2 19 153 17.2 
NOP 10 (H/ACA subunit 3) 1 17.2 64 7.7 
Nuclease-sensitive element binding protein 4 13.6 324 35.9 
Methylosome subunit pICIn 2 13.5 237 26.2 
TERT 11 10.6 1132 127 
Insulin-like growth factor 2 3 8.6 548 60.1 
TCAB1/WDR79/WRAP53beta 3 8.6 548 59.3 
squamous cell carcinoma antigen 6 8 963 110 
recognised by T-cells 3 (SART3) 

Importin subunit a-1 3 7.8 529 57.9 
Dyskerin (H/ACA subunit 4) 5 UL 514 57.7 
hnRNP A2/B1 2 7.7 313 33.9 
hnRNP A3 1 5.8 378 39.6 
Protein arginine N-methyltransferase 5 1 5.6 531 61.1 
ATP-dependent RNA helicase DHX36 2 (aol 1008 114.8 


Proteins are ranked according to their percentage of sequence coverage eliminating keratins, histones, and abundant cytoskeletal proteins. Only proteins with sequence coverages of above 5% or 
known telomerase-associated proteins (red background) with any level of detection are shown. 
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Extended Data Table 2 | Summary of cryo-EM data collection and modelling of protein and RNA subunits 


a. Cryo-EM data collection 


Data collection and 
processing 
Magnification 
Voltage (kV) 
Electron exposure (e—/A’) 
Defocus range (um) 
Pixel size (A) 
Symmetry imposed 
Initial particle images (no.) 
Final particle images (no.) 
Map resolution (A) 

FSC threshold 
Map resolution range (A) 


Catalytic Core 
(EMDB-7518) 


300 300 

42 42 

1.5-3.5 1.5-3.5 
1.15 1.15 

C1 C1 
1,091,177 1,091,177 
39,860 36,210 
7.7 8.2 

0.143 0.143 

6-9 7-9 


b. Modelling of protein and RNA subunits 


H/ACA lobe 
(EMDB-7519) 


Overall 
(EMDB-7521) 


300 


42 

1.5-3.5 
1.15 

C1 
1,091,177 
88,200 
10.2 
0.143 


Protein/RNA Total residues M.W. (KDa) Domain PDB code or modelling Reference 
TEN 2B2A (Tetrahymena model) 34 
— a ioe TRBD 4026 (medaka model) 33 
Sr 3KYL (Tribolium model) 13 
CTE 
TCAB1 548 59 B-propeller domain homology model by Phyre 
Dyskerin 514 57 
GAR1 217 21 
2HVY (archaeal model) 24 
NOP10 64 8 
NHP2 153 17 
P1 A-form double helix 
P2a.1 model from Chan et al, 2017 10, 38 
P2a-J2a/b-P2b 2L3E or 
template 3KYL 13 
P3 2K95 36 
ATR 451 138 P4 2HVY as part of the H/ACA RNP 24 
P4.1 A-form double helix 
P4.2 A-form double helix 
P5-P6.1-P6a 4026 (medaka model) 33 
P6b A-form double helix 
P7 2HVY as part of the H/ACA RNP 24 
P8 2QH2 39 
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Structural basis for dual-mode 
inhibition of the ABC transporter MsbA 


Hoangdung Ho!, Anh Miu*’, Mary Kate Alexander*”, Natalie K. Garcia+?, Angela Oh!, Inna Zilberleyb®, Mike Reichelt®, 
Cary D. Austin®, Christine Tam®, Stephanie Shriver>, Huiyong Hu’, Sharada S. Labadie’, Jun Liang’, Lan Wang’, Jian Wang®, 
Yan Lu®, Hans E. Purkey’, John Quinn’, Yvonne Franke?, Kevin Clark’, Maureen H. Beresini?, Man-Wah Tan’, 
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The movement of core-lipopolysaccharide across the inner membrane of Gram-negative bacteria is catalysed by an 
essential ATP-binding cassette transporter, MsbA. Recent structures of MsbA and related transporters have provided 
insights into the molecular basis of active lipid transport; however, structural information about their pharmacological 
modulation remains limited. Here we report the 2.9 A resolution structure of MsbA in complex with G907, a selective small- 
molecule antagonist with bactericidal activity, revealing an unprecedented mechanism of ABC transporter inhibition. 
G907 traps MsbA in an inward- facing, lipopolysaccharide-bound conformation by wedging into an architecturally 
conserved transmembrane pocket. A second allosteric mechanism of antagonism occurs through structural and functional 
uncoupling of the nucleotide-binding domains. This study establishes a framework for the selective modulation of ABC 
transporters and provides rational avenues for the design of new antibiotics and other therapeutics targeting this protein 


family. 


An obstacle in addressing the need for new antibiotics targeting 
Gram-negative bacteria is the requirement that these molecules 
penetrate the outer membrane’”. A modular glycolipid known as 
lipopolysaccharide (LPS) resides exclusively within the outer mem- 
brane and provides a highly impermeable barrier®. A specialized 
transport system moves LPS to the cell surface, but LPS biogenesis 
occurs in the cytoplasm**. There, oligosaccharides are ligated to a 
lipid A membrane anchor, forming core-LPS*. Because LPS cannot 
spontaneously exchange across the inner membrane at a physio- 
logically relevant rate, MsbA, an essential ATP-binding cassette 
(ABC) transporter, must carry out its vectorial transport®’. MsbA isa 
homodimeric member of the B-family of ABC transporters, in which 
ATP hydrolysis by the intracellular nucleotide-binding domains 
(NBDs) powers unidirectional LPS transport®. A recent framework 
provided by cryo-electron microscopy (cryo-EM) supports the exis- 
tence of inward-facing and outward-facing conformations of MsbA 
associated with a ‘trap and flip’ transport mechanism’. Notably, loss 
of MsbA causes rapid membrane perturbation and cell death’, iden- 
tifying it as a potential antibacterial target. 

The discovery of potent and selective modulators of ABC trans- 
porters remains challenging, limiting our mechanistic understanding 
of their pharmacological regulation. This is exemplified by MsbA, a 
largely unexplored antibacterial target that offers a potential new strat- 
egy to overcome existing antibiotic-resistance mechanisms. We set out 
to identify selective MsbA antagonists and found a quinoline class of 
molecules with bactericidal activity. Structural studies revealed that 
these inhibitors target a previously unknown allosteric transmembrane 
site to trap MsbA in an inward-facing, LPS-bound state that prevents 
transition to the outward-facing conformation. Upon inhibitor binding, 
the NBDs also become uncoupled and adopt an asymmetric confor- 
mation not previously observed in other ABC transporter structures. 


Overall, this study presents a structural template for selective small 
molecule inhibition within the ABC transporter superfamily. 


Discovery of selective MsbA inhibitors 

G592 is a quinoline compound with potent activity on purified 
Escherichia coli MsbA (EcMsbA) that was identified from a library 
of approximately 3,000,000 small molecules and specifically inhibits 
the activity of MsbA in cells (Fig. 1a, b, Supplementary Information 
Section 1 and Extended Data Figs. 1, 2). Inhibition was enantiomer- 
selective as the (R)-(—) form, G593, displayed almost tenfold lower 
activity than G592. Efforts aimed at improving the potency of G592 
led to the discovery of G247 and G907 (Fig. 1a, b; G247 half-maximum 
inhibitory concentration (IC59) = 5 nM, G907 ICs59 = 18 nM). G247 
and G907 treatment mirrored the distinctive membrane elaboration 
phenotype of E. coli cells depleted of MsbA’ (Fig. 1c and Extended 
Data Fig. 2) and inhibited the growth of wild-type E. coli and Klebsiella 
pneumoniae, with a minimum inhibitory concentration (MIC) of 
5.6 1g ml~!. As these quinolines represent the first potent and selective 
inhibitors of MsbA, we sought to understand the molecular basis for 
their antagonism through structural studies. 


Structure of MsbA in complex with G907 

Obtaining high-resolution structural information for MsbA has proven 
challenging"’, and recent cryo-EM structures did not achieve the reso- 
lutions necessary to support structure-based inhibitor design. We 
obtained crystals of ECMsbA in the presence of quinoline inhibitors 
that diffracted X-rays to 2.9 A resolution when a facial amphiphile was 
used’! (Extended Data Table 1). In complex with G907, MsbA adopts 
an inward-facing conformation in which a membrane-exposed pocket 
is formed between transmembrane helices TM4, TM5 and TM6 of each 
subunit (Fig. 2a—e). G907 is wedged into this crevice in each MsbA 
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Fig. 1 | Characterization of selective MsbA inhibitors. a, Inhibitors 
described in this study. b, Dose-response curves of purified E. coli MsbA 
in amphipols. Data are mean + s.e.m. from three independent experiments 
in all cases except for G593 where data are from two experiments. ICs 
values (shown in parentheses) were determined by fitting inhibition 
dose-response curves with a nonlinear four-parameter inhibition model 
(see Methods section ‘MsbA ATPase assay’). c, Representative thin 
section electron micrographs comparing msbA*, msbA~ and inhibitor- 
exposed cells, in which imp indicates use of the E. coli CFT073 UPEC 
IptD(imp4213) strain (Supplementary Table 1). Images are representative 
of 20 isolated cells from two independent experiments. Scale bars, 0.1 pm. 


subunit, halfway across the transmembrane region (Fig. 2a). There is 
no obvious access pathway to this site from the cytoplasm or inner 
vestibule, so it appears that G907 gains access to MsbA through the bulk 
membrane. EcMsbA crystallized in the presence of a second quinoline 
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inhibitor, G092, revealed a similar structure despite different lattice 
packing (Extended Data Fig. 3a—h and Extended Data Table 1). 

The 2-chloro-6-cyclopropylpheny] substituent of G907 (A-ring) 
exhibits strong electron density (Fig. 2b) and makes van der Waals 
interactions with side chains from TM4 (L171, A175 and V178), TM5 
(A259 and L263) and TM6 (M291 and L294) (Fig. 2d, e and Extended 
Data Fig. 4). The quinoline core of G907 (B-ring) is orthogonal to the 
plane of the phenyl substituent (Fig. 2d), where it is partially enclosed 
by residues from TM4 (V178, S179 and 1182), TM5 (A259) and TM6 
(M295 and L298) (Fig. 2d, e). A259 and M295 side chains contact the 
4-cyclopropyl substitution of the quinoline core and delineate the 
inhibitor binding-pocket from the inner vestibule of MsbA (Fig. 2d, e). 
The acrylic acid substitution forms an anchoring salt bridge with R190 
(TM5) and points towards the inner vestibule, where the trans olefin 
linker is surrounded by TM4 (1182), TM5 (1255) and TM6 (K299) 
(Fig. 2d). The tight fit of G907 (and G092) observed within the bind- 
ing pocket explains enantiomeric selectivity and rationalizes on-target 
resistance mechanisms for this new class of antibacterial compounds, in 
which two isolated resistant mutants, A175V and I182M, would steri- 
cally interfere with G907 binding (Fig. 2c, Extended Data Fig. 2d, e 
and Supplementary Information Section 1). As treatment of E. coli with 
quinoline inhibitors causes target-mediated cell killing (Fig. 1c), the 
G907-LPS-EcMsbA complex probably represents the physiologically 
relevant inhibitor-MsbA complex in vivo and reveals a previously 
unknown modulatory site within a B-family ABC transporter. 


State-dependent inhibition of MsbA by G907 

The outward-facing structure of Salmonella typhimurium MsbA 
(StMsbA)!°, and the recently determined inward-facing structure 
of EcMsbA in complex with LPS’ (LPS—EcMsbA-cryo), allow direct 
comparison of the G907-binding site under distinct conformational 
states (Fig. 3a). In the outward-facing StMsbA structure, the G907 
pocket is markedly deformed and expected to perturb inhibitor bind- 
ing (Fig. 3b). This substantial perturbation of the binding site during 
the transport cycle defines G907 as an inward-facing state-dependent 
inhibitor. Further comparison with the LPS-EcMsbA-cryo structure 
suggests an induced fit, whereby G907 binding is accommodated by 
a shift of TM4 by approximately 4 A to expose the full binding pocket 
(Fig. 3c). The presence of G907 causes strain along TM4, as evidenced 
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Fig. 2 | The G907-LPS-EcMsbA complex reveals a transmembrane 
inhibitor binding site. a, Inward-facing conformation of G907-LPS- 
EcMsbA complex with membrane boundaries, G907 (yellow) and LPS 
(green) indicated. b, 2F, — F. (10, green) and F, — F. (20, pink) maps 
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calculated before modelling G907, carved at 5 A. c, On-target resistance 
mutations: A175V (pink) and 1182M (purple). d, e, Alternate inhibitor 
binding-site views coloured from blue (TM3) to green (TM6). 
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Fig. 3 | Induced fit and state-dependent inhibition of MsbA by G907. 
a, Matched perspectives of G907-LPS-EcMsbA (this study), inhibitor- 
free LPS-EcMsbA-cryo (Protein Data Bank (PDB) accession 5T V4), and 
outward-facing St{MsbA (PDB accession 3B60) complexes; LPS has been 
omitted for clarity. b, Structural superposition reveals deformation of the 
binding site in the St{MsbA outward-facing state, where TM5 and TM6 
clash with G907. c, d, Superposition with the inhibitor-free LPS-EcMsbA- 
cryo structure (pink, cryo-TM4) indicates that part of TM4 translates 
(blue, G907-TM4a) to form the G907 binding site. TM4 also unwinds 
(asterisk, G907-TM4b) and R190 forms a salt bridge with the acrylic-acid 
substitution of G907 (d, inset). e, Superposition highlights structural 
differences propagated towards the NBDs, relative to the inhibitor-free 
LPS-EcMsbA-cryo structure; NBDs and subunit B have been removed 
for clarity. 


by alocal unwinding of the TM4 helix below the binding site (Fig. 3d). 
As a consequence, the R190 side chain becomes available to form an 
anchoring salt-bridge interaction with the acrylic acid substitution of 
G907 (Fig. 3d, inset). Overall, these observations are consistent with 
G907 acting as a selective inhibitor of an inward-facing state of MsbA 
(Fig. 3b, c). As LPS co-purifies with MsbA (Extended Data Fig. 1i), 
and G907 is not competitive with LPS (Fig. 2a), G907 appears to act 
by impeding the structural rearrangements necessary to transition 


198 | NATURE | VOL 557 | 10 MAY 2018 


LPS-MsbA to an outward facing conformation (Fig. 3a—c). Notably, 
G907 targets a site that is critical for mechanical coupling during 
the transport cycle!” and architecturally conserved in B-family ABC 
transporters!? (Extended Data Fig. 5). 


Allosteric inhibition of MsbA by G907 

The G907-LPS-EcMsbA structure reveals an unanticipated second 
mechanism of small-molecule antagonism whereby the two-fold sym- 
metry within the transmembrane region breaks down at the NBDs 
(Fig. 4a—c). Comparison of pseudosymmetric mock MsbA homo- 
dimers generated from the G907-LPS-EcMsbA structure reveals that 
the NBDs in the A’-A model are separated by approximately 10 A 
(Fig. 4b), whereas a severe clash between the NBDs occurs in the B-B’ 
model (Fig. 4c). This rationalizes why the NBDs in the G907-MsbA 
crystal structure have adopted an asymmetric conformation: they 
cannot simultaneously occupy the same physical space. Structurally, 
coupling helices 1 and 2 (CH1 and CH2) from subunit A have become 
unlatched from a highly conserved CH1-CH2-NBD interaction net- 
work linking the transmembrane domains to the NBDs, as first defined 
in Caenorhabditis elegans P-gp’? (CePgp; Fig. 4d). The pronounced 
asymmetric arrangement of NBDs in the G907-LPS-EcMsbA com- 
plex is expected to be incompetent for ATP hydrolysis (Fig. 4a—d and 
Extended Data Fig. 6), consistent with the potent inhibition of ATPase 
activity observed in vitro (Fig. 1b). NBD uncoupling was also observed 
in solution, as revealed by hydroxyl-radical footprinting-mass spectro- 
metry (HRF-MS)". Relative to the absence of inhibitor, MsbA foot- 
printing analysis in the presence of G907 and G092 revealed a marked 
increase in the solvent exposure of CH1 and CH2 (Fig. 4e, Extended 
Data Fig. 7 and Extended Data Table 2). 

Although functional asymmetry has been reported for transpor- 
ters with non-equivalent NBDs!>~!8, the structural asymmetry in the 
G907-LPS-EcMsbA complex is, to our knowledge, unlike that of any 
other ABC transporter structure yet determined”!*!>? (Fig. 4a-d 
and Extended Data Fig. 6). The profound uncoupling of the NBD 
machinery in EcMsbA suggests that G907 stabilizes a catalytically 
incompetent state of the transporter. Comparison of related B-family 
ABC transporter structures”! reveals that there is a unique displace- 
ment of one NBD by 10-15 A towards the membrane normal in the 
G907-LPS-EcMsbA complex (Extended Data Fig. 6). The molecular 
events leading to this NBD displacement originate approximately 50 A 
away at the inhibitor-binding site. In comparison to the LPS-EcMsbA- 
cryo structure’, inhibitor binding induces movement and strain along 
TM4 that, through an inward scissoring motion of the TM2-TM3 and 
TM4-TMS helices, forces the NBDs together (Fig. 3d, e). Concomitant 
release of one of the CH1-CH2-NBD interaction networks within the 
MsbA homodimer is necessary to avoid direct steric clashes between 
the NBDs (Fig. 4a—d). We conclude that G907 functions as an allosteric 
dual-mode inhibitor because NBD uncoupling occurs approximately 
50A away from the primary membrane-embedded binding sites 
(Fig. 2a). 


MsbDA uses a hydrocarbon ruler to measure LPS 

Until recently, the molecular basis of lipid recognition by MsbA was 
unknown. Compared to the LPS—EcMsbA-cryo structure’, a more com- 
plete LPS molecule is delineated in the 2.9 A G907-LPS-EcMsbA and 
G092-LPS-EcMsbA crystal structures (Fig. 5a, b and Extended Data 
Fig. 8). Although unambiguous placement of the core sugar moieties 
is challenging, the characteristic phosphoglucosamine substituents of 
the lipid A moiety are well-defined and surrounded by a coordination 
network of arginine side chains arising from the two-fold symmetric 
transmembrane architecture of the MsbA transporter, approximately 
halfway across the membrane bilayer (Fig. 5a—c and Extended Data 
Fig. 8). All six acyl chains of LPS have now been resolved, allowing 
direct visualization of densely packed hydrocarbon chains that point 
towards the closed periplasmic gate of MsbA (Fig. 5a—c and Extended 
Data Fig. 8). The G092-LPS-EcMsbA ternary complex reveals that 
the inner vestibule of ECMsbA forms a thimble-like structure that is 
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Fig. 4 | Asymmetric uncoupling of the NBDs in MsbA by G907. 

a, NBDs of G907-LPS-EcMsbA (subunit A, blue; subunit B, grey). 

b, c, Pseudosymmetric A’—A (b) and B-B’ (c) models built by 
superimposing the transmembrane domains. d, NBD-CH1-CH2 
coupling interface. EcMsbA residues D117 (yellow), K215 (orange) and 
¥393 (green) are highly conserved amongst B-family ABC exporters, as 
in CePgp (PDB accession 4F4C), in which these residues typically form 


expected to function as a hydrocarbon ruler to select for the 12-carbon 
and 14-carbon chains characteristic of LPS (Fig. 5a and Extended Data 
Fig. 8). As it would be less favourable to constrain the longer 18-carbon 
chains of a typical glycerophospholipid between the charged arginine 
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coupling interactions (< 3.5 A). e, Solution-based HRF-MS performed 
on LPS-EcMsbA with or without G907. Tryptic peptide oxidation of 
inhibitor-bound sample relative to inhibitor-free sample is shown. Data 
are mean + s.e.m. from three replicate experiments. NT, N terminus; ID1, 
ID2 and ID3 refer to the intracellular (cytoplasmic) CH1, CH2 and NBD 
regions of MsbA, respectively. 


coordination ring and the closed periplasmic gate (Fig. 5a), ECMsbA 
targets this second distinguishing feature of lipid A to achieve substrate 
selectivity over competing bulk membrane phospholipids, in addition 
to securing the bivalent phosphoglucosamine headgroup. 


f outward facing, LPS bound: 9 Outward occluded, apo: 
acyl-chain exposure to H,O hydrophobic partitioning 
and opening of lateral gate of LPS 


Fig. 5 | LPS selectivity and transport by MsbA. a, G092-LPS-EcMsbA 
complex highlighting LPS (yellow) and arginine and lysine residues 
(green) involved in lipid A coordination; G092 omitted. Bulk membrane 
is depicted on the basis of a molecular dynamics simulation. b, Fy — F. 
maps calculated with LPS omitted (1.0c, cyan; 5.00, pink), carved 

at 5A. c, Basic residues (green) form a selectivity filter around the 


phospho-N-acetylglucosamine (P-GIcN) moieties of lipid A, halfway 
across the membrane (see a for perspective). d~g, LPS transport model; 
see text for details. d, E. coli MsbA (PDB accession 3B5W); e, G907- 
LPS-EcMsbA as a proxy for the LPS-bound inward-facing state; f, StMsbA 
(PDB accession 3B60); g, ADP-vanadate-EcMsbA-cryo (PDB accession 
5TTP). 
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Implications for MsbA and ABC transporters 

The dual-mode inhibitory mechanism of G907 uncovered here may 
have broad but unrecognized relevance across the ABC transporter 
superfamily. A pocket analogous to the G907-binding site has been 
observed in related ABC transporters!> (Extended Data Fig. 5), so it 
is tempting to speculate that this site may be an allosteric modulatory 
hotspot for natural ligands, lipids, protein partners or designed small 
molecules. Furthermore, although the site and mechanisms by which 
G907 antagonizes MsbA are clearly distinct, it is noteworthy that the 
antagonist glibenclamide also wedges between transmembrane heli- 
ces in the ABC transporter-like protein SUR1”°, and that a nanobody 
inhibitor of PglK sterically impairs NBD closure, ATP hydrolysis and 
transition to an outward-facing conformation’. The SUR1 and PglK 
structures therefore independently highlight two emerging modula- 
tory concepts discovered for the dual-mode quinoline series inhibitors 
targeting MsbA: transmembrane-helical interdigitation and functional 
NBD uncoupling. 

The structure of LPS in complex with EcMsbA presented herein 
helps to reconcile evolutionarily conserved features of the LPS biosyn- 
thesis pathway*. Overexpression of msbA suppresses mutations in the 
htrB lipid A acyltransferase, as MsbA can transport the tetra-acylated 
LPS produced in htrB mutants, albeit inefficiently®. The tight packing 
of hexa-acylated LPS observed in our crystal structures (Fig. 5a, b and 
Extended Data Fig. 8) suggests that the binding of acyl chains into the 
inner vestibule of MsbA provides an element of substrate selectivity, 
rationalizing why tetra-acylated substrates are less preferred. MsbA 
cannot, however, efficiently transport a substrate lacking phosphory- 
lation at the 4’-position of lipid A. This last essential step of lipid A 
biosynthesis is catalysed by LpxK**”*. Notably, the IpxK gene is found 
immediately downstream of msDA in E. coli, and the two proteins are 
concatenated in some bacteria, suggesting tight coupling of their cel- 
lular activities. MsbA has evolved to directly target this unique feature 
of LPS as a primary basis for lipid selectivity (Fig. 5c). Although the 
overall features revealed by our structures do not rule out the possibility 
that other lipids are transported by MsbA, the multi-point coordination 
of the bivalent lipid A phosphoglucosamine moieties by an arginine 
selectivity filter and the snug fit of the short acyl chains within the 
inner vestibule are features that are consistent with a preference for 
selective LPS transport. 

The G907-LPS-EcMsbA and G092-LPS-EcMsbA complexes, in com- 
bination with structural information from other ABC transporters”, 
define an alternating access mechanism of LPS transport that is 
summarized in Fig. 5d-g. An open inward-facing conformation of 
MsbA must exist that promotes lateral ferrying of the massive LPS 
substrate from the membrane inner leaflet into the vestibule of MsbA 
(Fig. 5d, e). Upon binding, MsbA completely engulfs its LPS substrate 
(Fig. 5a, e), in contrast to other lipid transporters that leave the acyl 
chains of their substrates within the bulk bilayer'***”°. Preferential 
binding to the arginine coordination ring shuttles LPS halfway across 
the bilayer and shields its polar headgroup (Fig. 5e), where an intrinsic 
hydrocarbon ruler measures acyl chain length against a closed peri- 
plasmic gate to further ensure selective substrate engagement. In the 
absence of a G907-like inhibitor, ATP binding promotes NBD closure 
and an outward-facing conformation (Fig. 5f). Upon opening of the 
periplasmic gate (Fig. 5f, g), acyl-chain exposure to the aqueous envi- 
ronment and concomitant opening of a lateral gate provide a feasible 
pathway for LPS to exit MsbA and become flipped into the outer leaf- 
let of the bilayer (Fig. 5f, g). LPS discharge is probably also aided by 
ATP hydrolysis’, and after the release of inorganic phosphate and ADP 
from the NBDs, MsbA recycles back to an inward-facing conformation 
(Fig. 5d). 

The identification of novel dual-mode MsbA antagonists has 
allowed us to visualize unprecedented structural features of ABC 
transporter inhibition and lipid substrate selection. G907 and related 
quinoline antagonists do not inhibit MsbA by competing directly 
with the transported substrate, as is typical for other ABC trans- 
porter inhibitors. Unexpectedly, binding of inhibitors such as G907 
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into a transmembrane-exposed pocket promotes a cascade of molec- 
ular events that are propagated along TM4 and ultimately result in 
the unhinging of a conserved interaction network and subsequent 
functional and structural uncoupling of the NBDs. Considering the 
architectural conservation of the G907-binding pocket in other ABC 
transporters, this work should enable the discovery of selective antago- 
nists across the broader ABC transporter superfamily. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0083-5. 
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METHODS 


Protein expression and purification. The wild-type MsbA transporter from E. coli 
was cloned into a modified pET-52b expression vector and overexpressed using 
E. coli host Rosetta 2 (DE3) grown by fermentation at 17°C for 64h. Cells express- 
ing E. coli MsbA were collected and resuspended in 50 mM Tris, pH 8.0, 500 mM 
NaCl (Buffer A) supplemented with cOmplete Protease Inhibitors (Roche), 1 mM 
PMSF and 2 U ml"! of Benzonase nuclease (Sigma Aldrich). After cell lysis by 
microfluidization, FA-3 detergent (for structural studies and some biochemi- 
cal assays, as indicated; Avanti Polar Lipids) or lauryl maltose neopentyl glycol 
(LMNG, for biochemical assays; Anatrace) was added to 1% (w/v) and protein 
solubilization was carried out with gentle agitation at 4°C overnight. After centri- 
fugation at 185,000g for 1h, clarified supernatant was mixed gently with anti-Flag 
M2-agarose resin (Sigma Aldrich) pre-equilibrated with buffer B (buffer A sup- 
plemented with 0.06% (w/v) FA-3 or 0.005% LMNG (w/v)) for 1-2h at 4°C. Flag 
resin was collected by gravity flow, washed with ten column volumes of buffer B, 
and eluted with four column volumes of buffer B supplemented with 0.15 mg ml“! 
Flag peptide. The Flag-tag was removed using TEV protease and the cleaved E. coli 
MsbA was passed over a Superdex 200 column (GE Healthcare) in 20 mM Tris pH 8, 
100 mM NaCl and 0.06% (w/v) FA-3 or 0.005% LMNG (w/v). The peak frac- 
tions containing E. coli MsbA were pooled and concentrated to 10mg ml! using 
Vivaspin centrifugal devices (50 K MWKO). Protein aliquots were flash frozen 
and stored at —80°C. 

MsbA reconstitution into amphipols. Where the use of amphipol-incorporated 
MsbA is indicated, the purified MsbA in LMNG was adjusted to 1mg ml“! and 
reconstituted into A8-35 amphipols”°(Anatrace) by addition of an equal volume 
amphipol stock (20 mgml~!). The sample was incubated overnight at 4°C and 
resolved again over a Superdex 200 column (GE Healthcare) in 100 mM NaCl, 
20 mM Tris pH 8.0. The peak fractions containing E. coli MsbA were pooled and 
concentrated to 1mg ml! using Vivaspin centrifugal devices (50 K MWKO). 
Protein aliquots were flash frozen and stored at —80°C. 

Quantitation of co-purifying LPS. The LPS content of purified E. coli MsbA in 
amphipol and FA-3 incorporated MsbA was measured using a Limulus amoebocyte 
lysate chromogenic endotoxin quantitation assay, according to the manufacturer's 
instructions (Pierce). A standard curve was generated using LPS from E. coli strain 
O111:B4, as directed by the manufacturer. One endotoxin unit (EU) was assumed 
to be equal 0.1 ng of LPS. 

Reconstitution into nanodiscs. The process of E. coli MsbA reconstitution 
into nanodiscs”’ was initiated by mixing LMNG-solubilized MsbA protein with 
purified recombinant membrane scaffold protein (MSP) and sodium cholate- 
solubilized phospholipids 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphoethanolamine 
(POPE):phosphatidylglycerol (POPG) (9:1 w/w ratio). The sample mixture 
was dialysed into 100 mM NaCl, 20 mM Tris, pH 8.0 overnight at room temperature. 
E. coli MsbA nanodiscs were separated from protein aggregates and free MSP 
by resolving over a Superdex 200 column (GE Healthcare). Fractions containing 
E. coli MsbA nanodiscs were pooled, concentrated to 1 mg ml! and stored at 
—80°C. 

MsbA ATPase assay. The ATPase activity of MsbA was measured using a 
Tanscreener ADP2 FP Assay (BellBrook Labs). To determine the ICs» of MsbA 
inhibitors, compounds were incubated with 2 MsbA enzyme solution for 10 min 
and then 2 ATP solution was added to initiate the ATPase reaction. The final 
condition of the ATPase reaction was 3nM MsbaA (E. coli or Klebsiella, as indicated) 
in amphipol, 50 1M ATP in 50 mM Tris pH 7.5, 10mM MgCh, 1% glycerol, 0.1% 
bovine gamma globulin, 1 mM dithiothreitol (DTT), 0.007% Brij-35 and 0.5% 
dimethylsulfoxide (DMSO). The A175V MsbA mutant was screened at a final 
concentration of 9nM. The reaction was quenched by the addition of Transcreener 
Detection Buffer when about 5% of the ATP was hydrolysed to ADP. The ICs 
was determined by fitting the inhibition dose-response curve with a nonlinear 
four-parameter inhibition model (GraphPad Prism). The same assay conditions 
were used in the uHTS campaign (using E. coli MsbA in nanodiscs). 

Bacterial strains. Most E. coli strains used in this study were constructed in 
CFT073, an uropathogenic E. coli (UPEC) strain (Supplementary Table 1). Strain 
identity was confirmed by whole genome sequencing. For MsbA overexpression, 
the msbA open reading frame (ORF) was cloned into the EcoRI/Xbal sites of the 
arabinose-inducible, high-copy expression vector pBAD24 using PCR primers 
msbA-f2 and msbA-r2 (Supplementary Table 2). Overexpression of the MsbA pro- 
tein was confirmed by western blotting using polyclonal antibodies (1:2,000 sera 
dilution) raised against purified MsbA protein (Extended Data Fig. 2a). 

A conditional MsbA knockout (msbA-cond-ko) strain was constructed by 
subcloning the arabinose-inducible msbA from pBAD24 into the Sacl site of 
the integration vector pLDR9”* using PCR primers PBAD-msbA-f and PBAD- 
msbA-r (Supplementary Table 2). The NotI-digested and religated construct was 
integrated into the lambda att site using lambda integrase”*. The endogenous msbA 
gene was then replaced with a promoterless kanamycin marker flanked by FRT 
sites, amplified from pKD4 using primers msbA-KO-noPkan-f and msbA-KO- 


kan-r (Supplementary Table 2) and integrated into CFT073 by Red recombinase- 
mediated homologous recombination”’. 

To make strains with a permeablized outer membrane, the endogenous /ptD 
gene of the wild-type CFT073 or CFT073 msbA-cond-ko strain was replaced with 
an [ptD(imp4213) mutant allele*’. A construct was synthesized (GenScript) with 
200 bp of the /ptD ORF just upstream of the [ptD(imp4213) deletion followed by 
the remaining 1,299 bp of the jptD ORF codon changed to keep the same amino 
acid sequence but prevent recombination during integration, a promoterless 
kanamycin resistance marker flanked by FRT sites, and 200 bp homologous to 
the chromosomal sequence just downstream of the jptD ORE. The construct was 
integrated onto the chromosome by Red recombinase-mediated homologous 
recombination”®. 

An efflux-deficient mutant of the E. coli K12 strain MG1655 (ATCC 700926), 
MG1655 AtolC, was made by transducing the AtolC::kanR allele from the Keio 
collection strain JW5503*! using P1vir phage (E. coli Genetic Stock Center 12133). 
Cellular ICs) and MIC determinations. Two-hundred-nanolitre aliquots of eight 
twofold dilutions of each compound were dispensed in duplicate into 384-well 
plates using an Echo Liquid Handler (Labcyte). Strains to be tested were grown in 
M9 medium supplemented with 1% maltose and, for the overexpression strains, 
0.2% arabinose overnight for 16h, washed with PBS, resuspended at OD¢o0 = 1, 
and frozen in 100-1 aliquots for use in screening. For each experiment, aliquots 
were thawed on ice and diluted 1:1,000 into M9 1% maltose, plus 0.2% arabinose 
for overexpression strains. Ten microlitres of cells per well was dispensed into 
384-well plates containing compound using a Multidrop Combi Reagent Dispenser 
(Thermo). Plates were incubated for 6h at 37°C, then 10 jl BacTiter-Glo reagent 
(Promega) was added to each well and luminescence was measured. ICs9 values 
were determined by fitting the inhibition dose-response curve with a nonlinear 
four-parameter inhibition model (GraphPad Prism). Reported values are the stand- 
ard geometric means of >2 independent values. MICs were determined in cation- 
adjusted Mueller-Hinton broth with 0.002% Tween-80 following the standard 
Clinical and Laboratory Standards Institute protocol”. 

Electron microscopy. To prepare cells for electron microscopy, an overnight 
culture of CFT073 IptD(imp4213) grown in LB was diluted 1:100 in LB; in addi- 
tion an overnight culture of CFT073 lptD(imp4213) msbA-cond-ko, which was 
grown in LB 2% arabinose to induce MsbA expression to wild-type levels, was 
diluted 1:100 in LB 2% arabinose. Both cultures were grown to log phase for 2.5h 
at 37°C, then pelleted, washed twice with PBS and resuspended at OD¢o9 = 0.1 in 
LB. Fifty microlitres of 10mM G03047592 or 50 sl DMSO (no drug control) was 
added to 5 ml CFT073 [ptD(imp4213); 1% glucose was added to a 5-ml culture 
of CFT073 IptD(imp4213) msbA-cond-ko to fully repress MsbA expression. All 
cultures were incubated for 3h at 37°C. Cells were pelleted, washed with PBS, fixed 
with 100 jl Karnovsky’s fixative (2% paraformaldehyde and 2.5% glutaraldehyde 
in 0.1 M sodium cacodylate buffer, pH 7.2), post-fixed in 1% reduced osmium 
tetroxide (EM Sciences) for 1h and then incubated in 0.5% uranyl acetate for 1h. 
The samples were dehydrated through an ascending series of ethanol (50%, 70%, 
90%, 100%), then washed with propylene oxide and finally embedded in Eponate 
12 (Ted Pella). Ultrathin sections (80 nm) were cut with an Ultracut microtome 
(Leica), stained with 0.2% lead citrate and examined in a JEOL JEM-1400 trans- 
mission electron microscope at 80kV. Digital images were captured with aGATAN 
Ultrascan 1000 CCD camera. 

Isolation of resistant mutants and whole genome sequencing. Resistant isolates 
to G592 were obtained by plating MG1655 AfolC on LB agar with 1.8 pg ml! G592 
(2 x MIC). Growth of MG1655 AfolC at 4 x MIC gave rise to no viable colonies. 
Genomic DNA was isolated from 2 ml of overnight culture of the resistant mutants 
using a DNEasy Blood and Tissue Kit (Qiagen). Genomic DNA was sheared 
between 350 and 500 bp using a Covaris LE220. The libraries were prepared using 
a TruSeq Nano library prep kit from Illumina (FC-121-4001). Sequencing was 
carried out in the Illumina Miseq (1x 150 cycles). An average of ~21 million 
2 x 75-bp paired reads were obtained per sample. All fastq files were then filtered 
for read quality, keeping reads in which at least 70% of the cycles had Phred scores 
> 23. Filtered reads were aligned to the E. coli K-12 MG1655 reference genome 
(NCBI Reference Sequence accession number NC_000913) using GSNAP version 
2013-10-10°* and variant analysis was performed using the Bioconductor package 
VariantTools**. To increase confidence in variant calls, base quality scores of 30 
or higher were required. 

Protein crystallization. Inhibitor G907 (or G092) was added to E. coli MsbA at 
10 mgm!" in FA-3 to a final concentration of 1 mM and incubated for 1h on ice. 
The G907-LPS-EcMsbA (or G092-LPS-EcMsbA) complex was crystallized at 
19°C using sitting-drop vapour diffusion by mixing the complex with mother 
liquor (200 mM NaCl, 19% PEG 550 MME, 4% PEG 400, 100 mM HEPES, pH 7.0) 
at a 1:1 (v/v) ratio. Crystals reached full size after 1 week and were then flash frozen 
in cryoprotectant containing crystallization buffer plus additional 20% PEG 400. 
Heavy atom labelling for crystallographic phasing. E. coli MsbA purified in FA-3 
detergent was adjusted to 1 mgml~!. Ethyl mercurithiosalicyclic sodium salt (Hg) 
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was added to the final concentration of 1 mM and the mixture was incubated 
overnight at 4°C. Free mercury was removed from Hg-labelled MsbA protein by 
passing the sample onto a Superdex 200 column in 100mM NaCl, 0.06% FA-3, 
20 mM Tris, pH 8.0. The fractions containing Hg-labelled MsbA were pooled and 
concentrated to 10mg ml! using Vivaspin (50 K MWCO) centrifugal devices. 
MsbA inhibitor G907 was added to the final concentration of 1 mM and the mix- 
ture was incubated on ice for 1h. Crystal trays were set up by mixing MsbA protein 
sample at 1:1 (v/v) ratio with mother liquor containing 200 mM NaCl, 20% PEG 
550 MME, 100mM HEPES, pH 7.0. Crystals were grown at 19°C in sitting drops 
for one week then collected and flash frozen in cryoprotectant containing crystal- 
lization buffer with increased PEG 500 MME concentration at 30%. 

Structure determination, refinement and analysis. Diffraction data were 
collected at 100 K using beamline 12-2 of the Stanford Synchrotron Radiation 
Lightsource and beamline 5.0.2 of the Advanced Light Source. All X-ray diffraction 
data were integrated and scaled with the HKL2000 suite® and further processed 
with the CCP4 package”. All attempts to determine the G907-LPS-EcMsbA struc- 
ture by molecular replacement using a wide range of available ABC transporter 
models (full and partial models) failed. Because wild-type EcMsbA contains two 
native cysteine residues (Cys88 and Cys315) per subunit, we screened and pro- 
duced a covalent mercury complex of EcMsbA using the fluorescence detection of 
heavy-atom labelling (FD-HAL) method?”. This Hg-EcMsbA complex was sub- 
sequently co-crystallized with G907 as described above. To obtain experimental 
phases, a SAD dataset was collected at ALS 5.0.2 near the mercury absorption 
edge (A= 1.005 A) from a mercury-containing complex of G907-LPS-EcMsbA 
that was crystallized in the P1 space group (Extended Data Table 1). This P1 space 
group was ultimately found to contain two full MsbA transporters within the asym- 
metric unit. Phenix AutoSol** was initially able to identify five of a possible eight 
Hg sites within the heavy atom substructure at ~8.3 A (anomalous signal 0.1158; 
SKEW = 0.023; CORR_RMS = 0.738; FOM = 0.285), and according to standard 
phasing and density modification protocols (FOM = 0.73), all 12 transmembrane 
helices of the transporter could be clearly resolved using data extending to ~3.7A 
resolution. The transmembrane skeleton of MsbA was manually built initially as 
a poly-Ala structure. For the NBDs, an available high-resolution model of the 
Haemophilus influenzae MsbA NBD (PDB accession 5EUM) was truncated to 
poly-Ala and placed manually into the observable NBD density. Reiterative rounds 
of molecular replacement with SAD phasing (MR-SAD) using PHENIX** were 
employed to extend the backbone structure (for example, periplasmic loops) 
and make observable rigid body adjustments within the NBDs. Relative to the 
transmembrane domains of EcMsbA, the structural asymmetry of the NBDs was 
apparent at very early stages of mercury-phased model building. A 2.9-A resolution 
native dataset from a G907-LPS-EcMsbA crystal in space group P2) was obtained, 
collected (A= 1.000 A) from SSRL 12-2, and found to contain one MsbA trans- 
porter (homodimer) within the asymmetric unit. Molecular replacement using 
an early model of G907-LPS—EcMsbA was performed using PHENIX software. 
Reiterative rounds of refinement and model building in Coot*® were guided by 
inspection of omit maps. Strict geometry and secondary structure restraints were 
applied throughout refinement to maintain stringent stereochemistry; however, 
non-crystallographic symmetry restraints were ultimately not applied because 
of the asymmetry within the NBDs. Although strong omit density was apparent 
at very early stages, coordinates for G907, LPS and phospholipids (assigned as 
phosphatidylethanolamine bound along the bulk membrane face of EcCMsbA) were 
added only at very late stages of refinement. A 2.92 A resolution native dataset 
from a G092-LPS—EcMsbA crystal in space group C2; was also obtained, collected 
(A= 1.000 A) from SSRL 12-2, and also found to contain one MsbA transporter 
(homodimer) within the asymmetric unit. As a starting point, all ligands from 
the refined G907-LPS—EcMsbA structure were removed and the B-factors of the 
remaining model were reset to 50 A? and then molecular replacement was per- 
formed using PHENIX software. Initially, real-space rigid-body adjustments of 
the NBDs and transmembrane helices were performed in Coot; then reiterative 
rounds of refinement with strict geometry and secondary structure restraints 
applied throughout were guided by inspection of omit maps in Coot*’. Coordinates 
for G092, LPS and phospholipids (assigned as phosphatidylethanolamine) were 
added only at very late stages of refinement. The geometry of the final G907- 
LPS-EcMsbA and G092-LPS-EcMsbaA structural models was assessed using 
MolProbity*® and structural alignments were performed using LSQMAN*". All 
structural figures were prepared with the PyMol software”. 

HRE-MS experiments. E. coli MsbA at 10mg ml! in FA-3 detergent was diluted 
to 0.9mg ml“! in 100mM NaCl, 0.06% FA-3, 20mM Tris pH 8.0. Three samples 
were prepared for FPOP analysis: inhibitor-free MsbA and MsbA incubated with 
1mM of either G907 or G092, after lyophilization of the appropriate amount of 
each inhibitor in the sample tube. HRF-MS analysis was performed using fast 
photochemical oxidation of proteins (FPOP) as previously described**. To avoid 
light exposure, MsbA solution and 30 mM H20) were rapidly mixed through 
a micro-tee mixer (Cobert Associates Laboratory) at flow rates of 15 jl min~! 
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and 7.511 min” ', respectively. The oxidation reaction was initiated by pulsing a 
focused 248 nm KrF excimer laser (GAM Laser) at 75 mJ per pulse at a rate of 
7 Hz through a 3.0 mm exposure window of 150 j1m internal diameter fused-silica 
tubing (Polymicro Technologies). Oxidized samples were collected in a tube con- 
taining a quench solution with methionine (Sigma Aldrich) and catalase (Sigma 
Aldrich) in a final concentration of 40 mM and 1 tM respectively. Each quenched 
sample was frozen before proteolysis and mass spectrometry. To account for back- 
ground oxidation of MsbA, an unexposed sample was obtained and compared to 
three oxidized replicates for each protein state. 

After FPOP, each protein sample was reduced and denatured at 45°C for 
10 min in reducing (10 mM DTT, 25 mM Tris, pH 8.0) and denaturation buffer 
(6M guanidinium hydrochloride, 1.0 mM EDTA, 250 mM Tris, pH 8.0). Reduced 
cysteines were alkylated with 25 mM sodium iodoacetate in denaturation buffer 
for 5 min at 45°C and quenched with 50 mM DTT at room temperature for 5 min. 
The samples were desalted before proteolysis over an Illustra NAP-5 column (GE 
Healthcare Bio-Sciences) equilibrated in 25 mM Tris, pH 8.0, and the eluate was 
digested with 5 1g of sequencing grade recombinant Trypsin (Roche) at 37°C 
for 1h. The digest was quenched with formic acid to a final percentage of 3.5 for 
liquid chromatography. 

Ten micrograms of tryptic peptides of MsbA was resolved on a Waters 
ACQUITY UPLC using a CSH C18 (130 A,1.7 jum, 2.1 x 150mm) column heated 
to 77°C. Peptides were eluted with a gradient of 99% buffer A (water, 0.1% formic 
acid) to 50% buffer B (acetonitrile, 0.1% formic acid) over 40 min at 0.3 ml min7!. 
To prevent carry-over between injections, the column was washed with 95% 
buffer B for 4 min and re-equilibrated to 99% buffer A for 20 min before the next 
injection. Peptide spectra were collected on a Thermo Fisher Q-Exactive hybrid 
quadrupole-orbitrap mass spectrometer. Data-dependent acquisition was used to 
collect tandem mass spectrometry (MS/MS) spectra of the most abundant peptides 
for the oxidized and unoxidized samples. Unmodified and oxidized species for 
each tryptic peptide were identified from the fragmentation spectra using Byonic 
(Protein Metrics). Percentage oxidation was calculated as previously described 
using the extracted mass of each identified species from the precursor scans 
(350-2000 m/z) with Byologic (Protein Metrics)**°, The tryptic peptides meas- 
ured are summarized in Extended Data Table 2. 

Molecular dynamics. To estimate how the MsbA dimer sits relative to the bacterial 
inner membrane, a complete model of MsbA derived from the refined G907- 
LPS—EcMsbA 2.9-A X-ray structure was placed in a simulated lipid and water 
environment. An all-atom model was generated from the X-ray crystal structure 
using the Protein Preparation function in Maestro***” which adds missing residues, 
side chains and hydrogens, predicts residue protonation states, and optimizes side- 
chain conformations. The G907 inhibitors, phospholipid, and LPS molecules were 
then removed. A simulation box of protein, lipids, water and ions was generated 
using the System Builder in Maestro’ using the following steps. The protein was 
first placed in a POPE lipid bilayer. The bilayer was initially aligned manually to the 
region where the protein surface is most hydrophobic. The system was neutralized 
with the addition of seven chlorine ions. An orthorhombic box was constructed 
with a 10-A buffer around the protein in all dimensions and the regions of the 
box not occupied by protein or lipid were filled with transferable intermolecular 
potential with three points (TIP3P) waters. The resulting system was then equili- 
brated using the relax_membrane.py”” and Desmond multisim**” programs. 
A final 10-ns NVT simulation was run using Desmond, with a temperature of 
300 K, a 2-ps time step, and with the protein backbone constrained (force constant 
of 5.0kcal mol~! A). The lipids from the final frame of the simulation were used 
and presented in Fig. 5a and Extended Data Fig. 8d-f. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. The data that support the findings of this study are available 
from the corresponding authors upon reasonable request. The crystallographic 
datasets for the G907-LPS-EcMsbA and G092-LPS-EcMsbA complexes that 
support the findings of this study have been deposited in the Worldwide Protein 
Data Bank repository (https://wwpdb.org/) under PDB ID accession codes 6BPL 
and 6BPP, respectively. 
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Extended Data Fig. 1 | Evaluation of the biochemical activity of 

MsbA and inhibitors. a, Quinoline inhibitors described in this study. 

b, Comparison of ATPase activity of increasing concentrations of E. coli 
MsbA in amphipol, nanodisc and FA-3 detergent matrices. c, d, Dose- 
response curves of compounds on purified Klebsiella pneumoniae (c) and 
Pseudomonas aeruginosa (d) MsbA reconstituted in amphipols. e, f, Dose- 
response curves of compounds on purified E. coli MsbA reconstituted 

in nanodiscs (e) or FA-3 detergent (f). g, h, Dose-response curves of 
compounds on purified E. coli A175V mutant MsbA reconstituted in 
amphipols (g) and nanodiscs (h). i, The LPS content of purified E. coli 
MsbA in the indicated matrices was measured using a chromogenic 


endotoxin quantitation assay (see Methods section ‘Quantitation of co- 
purifying LPS’). LPS from E. coli strain O111:B4 was used to generate a 
standard curve. Note that for both MsbA samples (0.0024 ng ml~ 1) the 
concentration of LPS (more than 0.02 ng ml~') exceeds the concentration 
of MsbA, indicating a molar excess of co-purifying LPS. Data are 

mean +s.e.m. from five independent experiments (b), three independent 
experiments (c-h) or two independent experiments (i). ICso values (b-h, 
in parentheses) were determined by fitting the inhibition dose-response 
curve with a nonlinear four-parameter inhibition model (see Methods 
section ‘MsbA ATPase assay’). 
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Extended Data Fig. 2 | Evaluation of phenotypic activity of MsbA 
inhibitors. a, ~a-MsbA western blot of solubilized extracts from an 

E. coli CFT073 UPEC imp strain overexpressing MsbA from a pBAD24 
vector in the presence of 2% arabinose (lane 1), an E. coli CFT073 imp 
strain (lane 2), an E. coli CFT073 imp strain with the arabinose-titratable 
msbA expression system in the presence of 2% arabinose (lane 3), an 

E. coli CFT073 imp strain with the arabinose-titratable msbA expression 
system in the presence of 0.01% arabinose (lane 4) and an E. coli CFT073 
imp strain with the arabinose-titratable msbA expression system in the 
presence of 0.002% arabinose (lane 5). Bacteria were grown for 3h at 
37°C before collection. The blot is representative of n =2 independent 
experiments. For gel source images, see Supplementary Fig. 1. b, Dose- 
response curves of compounds on E. coli CFT073 imp strains expressing 
endogenous (WT) and overexpressed (HIGH, in the presence of 0.2% 
arabinose) levels of MsbA. Data are mean + s.e.m. from two independent 
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experiments except for G247 (n =4). ICso values (in parentheses) 

were determined by fitting the inhibition dose-response curve with a 
nonlinear four-parameter inhibition model (see Methods section ‘MsbA 
ATPase assay’). c, Representative (20 isolated cells from two independent 
experiments) thin section electron micrographs of CFT073 lptD(imp4213) 
and CFT073 wild-type strains. Treatment of the [ptD(imp4213) strain with 
G592 (middle left) or G907 (middle right) and wild-type UPEC with G247 
(bottom) leads to membrane stacking (red arrow) and enlargement of 
cells, similar to cells depleted of MsbA (top right). Scale bars, 0.1 jum. 

d, e, Nucleotide and amino acid sequence changes (indicated in red) in two 
G592-resistant MG1655 AtolC clones (mut1 and mut2, d) and a G247- 
resistant wild-type E. coli clone (mut3, e, data not shown). For both panels, 
positions are relative to the start of msbA ORF. The frequency of resistance 
for G247 on wild-type E. coli was 3.9 x 10~° (data not shown). 
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Extended Data Fig. 3 | Comparison of the G907-LPS-EcMsbA and 
G092-LPS-EcMsbA complexes. a, b, Views of the crystal packing 
environments in the G907-LPS—EcMsbA and G092-LPS-EcMsbA 
complexes show distinct crystal lattice contacts. c, The superposition of 
G907-LPS-EcMsbA and G092-LPS-EcMsbA complexes indicates a high 
degree of structural similarity, reflected in a Ca r.m.s.d. of 0.8 A over 

all modelled residues. d, Chemical structure of G092. e, Dose-response 
curves of G092 on purified E. coli MsbA in FA-3 detergent or reconstituted 
into amphipols and on purified P. aeruginosa MsbA reconstituted into 
amphipols. Data are mean + s.e.m. from three independent experiments. 
ICs values (indicated) were determined by fitting the inhibition 
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blue: G907-LPS-EcMsbA 
pink: GO92-LPS-EcMsbA 


EcMsbA, C2) 


100 


dose-response curve with a nonlinear four-parameter inhibition model 
(see Methods). f, Overall 2.92-A crystal structure of the G092-LPS- 
EcMsbA complex shown from a side-view. LPS bound within the inner 
vestibule is shown in green sphere representation and G092 is shown in 
thin yellow stick representation. Fy — F, difference electron density maps 
(calculated before the addition of G092 into the model and refinement) 
are contoured at 20 (cyan) and 3o (pink). g, Fy — F, difference electron 
density maps of G092-LPS-EcMsbA (calculated before the addition of 
G092 into the model and refinement) are contoured at 20 (cyan) and 30 
(pink). The final refined coordinates of G092 are shown for reference. 
h, As in g, but a side-view of the G092 binding site. 
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Extended Data Fig. 4 | Molecular interactions of G907 in complex with 


@ polar oo" » sidechain acceptor 
O acidic + sidechain donor 
O basic *-® backbone acceptor 
© greasy ‘backbone donor 


£"" proximity contour 


EcMsbA. a, b, The MOE (Molecular Operating Environment; Chemical 
Computing Group ULC) software was used to highlight the molecular 
interactions observed between G907 and MsbA, with two similar 


© solvent residue arene-arene 
© metal complex OH arene-H 
ie solvent contact O+ arene-cation 
sae metal/ion contact 


illustrations of this representation provided. c, A legend describing the 
characteristics of the interacting groups as well as the types of interactions 
shown in a and b. 
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G907-LPS-EcMsbA 
subunit A 


Extended Data Fig. 5 | A conserved small molecule or peptide 
transmembrane binding pocket in the MsbA and Pgp B-family ABC 
transporters. a, Top, surface representation of the G907-LPS-EcMsbA 
crystal structure (subunit A, blue; subunit B, grey) with G907 in red 
sphere representation. LPS is omitted for clarity. Bottom, expanded 
view of the G907-binding site. b, Top, CePgp crystal structure in surface 
representation (N-terminal half, blue; C-terminal half, grey) with the 
extended N-terminal peptide (NTP) region of the transporter shown in 
red surface representation. Bottom, expanded view of the NTP-binding 
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G907-LPS-EcMsbA 
subunit B 


site. c, Similar to a, G907-LPS-EcMsbA crystal structure in surface 
representation (subunit A, blue; subunit B, grey) with G907 in red 

sphere representation, viewed approximately 180° relative to a to focus 

in on subunit B. LPS is omitted for clarity. Bottom, expanded view of 

the G907-binding site. d, Similar to a, CePgp crystal structure in surface 
representation (N-terminal half, blue; C-terminal half, grey) in which two 
modelled detergent molecules of n-undecyl-8-p-thiomaltopyranoside 
(UDTM) are shown in red sphere representation. Bottom, expanded view 
of the UDTM-binding site. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Unique relative displacement of NBDs in both 
the G907-LPS-EcMsbA and G092-LPS-EcMsbA complex crystal 
structures. a—e, Structural superpositions of related B-family ABC 
transporters are provided in two orthogonal views, in which all structural 
superpositions were performed by aligning only the transmembrane 
portion of each subunit B protomer onto that of subunit A. In the case of 
CePgp, the C-terminal transmembrane region was superimposed onto 
the N-terminal transmembrane region. The structures are as follows: 

a, G907-LPS-EcMsbA; b, G092-LPS-EcMsbA; c, LPS-EcMsbA-cryo 
(PDB accession 5T V4); d, CePgp (PDB accession 4F4C); e, Thermus 
thermophilus TmrAB (PDB accession 5MKK). In comparing these 
structures, it is noteworthy that EcMsbA is a homodimeric transporter, 
whereas CePgp is a concatenated transporter, and TmrAB is a 
heterodimeric transporter; yet the structural similarity within all 


ARTICLE 


‘self-pairs’ of transmembrane regions is high. Specifically, when excluding 
the NBDs from this analysis: a, Ca r.m.s.d. 1.26 A; b, Ca r.m.s.d. 0.82 A; 

c, Ca rm.s.d. 1.09 A; d, Ca r.m.s.d. 3.18 A; e, Ca rm.s.d. 1.66 A. In 
contrast to the high structural correspondence observed between their 
superimposed or ‘self-paired’ transmembrane regions, the G907-LPS- 
EcMsbA and G092-LPS-EcMsbA complex structures both show a marked 
translational displacement of one NBD relative to the other, measuring 

up to 15 A between equivalent Cu positions. It is notable that the extent 
and apparent vector of the structural translation observed at the level of 
the NBDs are unique to the inhibitor-bound EcMsbA complexes studied 
here. For reference, the three residues that constitute the highly conserved 
CH1-CH2-NBD coupling interaction network are shown in yellow sphere 
representation in all structures. 
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Extended Data Fig. 7 | HRF-MS of LPS-EcMsbA in the presence and 
absence of G907 and G092. a, A schematic overview (with the target 
protein illustrated as a cartoon IgG shown in green) of the HRF-MS 
experiments performed in this study on LPS-EcMsbA in FA-3 detergent 
conditions in the presence or absence of G907 and G092. LC-MS/MS, 
liquid chromatography with tandem mass spectrometry; UPLC, ultra- 
performance liquid chromatography. b-c, Data are displayed as the 
differences measured in tryptic peptide oxidation of the ligand-bound 
condition with G907 present (b) or G092 present (c) relative to the 
inhibitor-free LPS-EcMsbA solution sample. The median residue number 
of the individual tryptic peptides (summarized in Extended Data Table 2 
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for the G907 experiment) is indicated on the x-axis. For both HRF-MS 
analyses, data are mean + s.e.m. from three replicate experiments. 

d, A cartoon rendering of the G907-LPS-EcMsbA crystal structure 
coloured according to the measured differences in tryptic peptide 
oxidation between the inhibitor-free and G907-bound condition presented 
in b. Regions without significant measured changes are coloured in dark 
grey and are found mainly within the transmembrane region; regions 
lacking robust peptide coverage are coloured in light grey. The phospho- 
N-acetylglucosamine moieties of the bound LPS molecule are shown only 
for reference, the remainder of LPS and G907 are omitted for clarity. 
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Extended Data Fig. 8 | Electron densities and electrostatic features 

of the LPS coordination site in the inner vestibule of the GO92-LPS- 
EcMsbA crystal structure. a, F, — F, difference electron density maps 
(calculated before the addition of LPS into the model and refinement) are 
contoured at 2.50 (purple) and 5o (pink). b, Fy — F, difference electron 
density maps (calculated with LPS omitted from the model) contoured 

at 30 (purple) and 5c (pink). c, Fy — F, difference electron density maps 
(calculated with LPS omitted from the model) contoured at 1o (cyan) and 
50 (pink). a—c, The final refined coordinates of LPS are also shown for 
reference and the arginine side chains are omitted for clarity. The observed 
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electron density positions the phospho-N-acetylglucosamine moieties of 
bound LPS within the conserved arginine coordination ring. d, e, Overall 
electrostatic surface representation of MsbA in the G092-LPS-EcMsbA 
complex with LPS shown in yellow stick representation and G092 omitted 
for clarity. For reference only, the position of the bulk membrane bilayer 
is depicted on the basis of a 10-ns molecular dynamics simulation 

(see Methods section ‘Molecular dynamics’). f, Similar to d, e, expanded 
view with the closest MsbA subunit removed for clarity and LPS shown 

in yellow stick representation. g, Similar to f, expanded view from the 
cytoplasmic perspective, where both MsbA subunits are displayed. 
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Extended Data Table 1 | Data collection, phasing and refinement statistics 


Data collection 


Native 
G907-LPS-EcMsbA 


Native 
G092-LPS-EcMsbA 


Hg-SAD° 
G907-LPS-EcMsbA 


Space group P24 C21 Pl 
Cell dimensions 

a, b,c (A) 74.7, 91.6, 111.0 256.3, 89.8, 72.5 71.4, 89.2, 109.7 

a By’) 90, 89.4, 90 90, 91.3, 90 90.3, 90.1, 89.9 

Hg-Peak 

Wavelength 1.000 1.000 1.005 
Resolution (A)? 40 — 2.90 (3.00-2.95; 2.95-2.90) 40 — 2.92 (3.02-2.97; 2.97-2.92) 22 — 3.73 (3.8 - 3.73) 
Rumerge 0.097 (0.50; 0.53) 0.098 (0.55; 0.57) 0.03 (0.42) 
I/o(I) 11.97 (1.09; 0.95) 10.81 (1.17; 1.1) 7.48 (1.31) 
CCin 0.995 (0.624; 0.576) 0.996 (0.553; 0.509) 0.997 (0.513) 
Completeness (%) 91.1 (70.4; 61.0) 92.3 (83.3; 78.0) 94.4 (90.4) 
Redundancy 4.0 (2.6; 2.4) 3.9 (2.7; 2.6) 1.7 (1.6) 
Refinement 
Resolution (A) 39.9 —2.91 38.2 — 2.92 
No. reflections 29,686 (1528) 32,865 (2137) 
Rwork / Riree® 23.33 / 28.29 25.8 fQITAT 
No. atoms? 

subunit A 4469 4389 

subunit B 4358 4124 

G907/G092 62 60 

LPS 233 225 

PE 58 53 
B factors*® 

subunit A 73.47 61.85 

subunit B 80.00 72.85 

G907 62.41 48.07 

LPS 121.93 116.58 

PE 93.34 80.62 
r.m.s deviations 

Bond lengths (A) 0.003 0.004 

Bond angles (°) 0.666 0.792 


Single crystal used for each dataset. 

*Values in parentheses are for highest-resolution shells. 

>Anomalous signal, 0.1158; SKEW, 0.023; CORR_RMS = 0.738; FOM after SAD phasing/density modification, 0.285/0.73. 

°5% of reflections used to calculate Ryree- 

dRepresents all non-hydrogen atoms modelled: native-G907 Ramachandran plot, 95.42% favoured, 4.58% allowed, 0% outlier; native-GO92 Ramachandran plot: 95.4% favoured, 4.6% allowed, 
0% outlier. 
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Extended Data Table 2 | List of tryptic peptides measured in Fig. 4e HRF-MS experiments 


Median peptide Tryptic 
residue, as peptide Tryptic peptide sequence 
indicated on residues: (median residue indicated in bold and underlined) 
Figure 4e start - end 
10 6-14 DLSTWOTFR 
20 16-24 LWPTIAPFK 
42 25-58 AGLIVAGVALILNAASDIFMLSLLKPLLDDGFGK 
70 62-78 SVLVWMPLVVIGLMILR 
87 79-95 GITSYVSSYCISWVSGK 
99 96-101 VVMIMR 
111 104-118 LFGHMMGMPVSFFDK 
123 119-127 QSTGTLLSR 
138 128-148 ITYDSEQVASSSSGALITVVR 
166 149-183 EGAS IIGLFIMMFYYSWQLSIILIVLAPIVSIAIR 
204 195-212 NMONTMGQVIETSAEQMLK 
222 216-228 EVLIFGGQEVETK 
272 244-299 MVSASSISDPIIQLIASLALAFVLYAASFPSVMDSLTAGT ITVVFSSMIALMRPLK 
305 300-310 SLTNVNAQFOR 
320 311-328 GMAACOTLFTILDSEQEK 
342 338-345 ATGDVEFR 
350 346-354 NVTFTYPGR 
358 355-360 DVPALR 
374 371-377 TVALVGR 
387 383-391 STIASLITR 
400 392-408 FYDIDEGEILMDGHDLR 
413 409-416 EYTLASLR 
429 417-441 NQVALVSQNVHLFNDTVANNIAYAR 
452 448-455 EQIEEAAR 
461 456-465 MAYAMDFINK 
476 466-486 MDNGLDTVIGENGVLLSGGOR 
496 494-497 ALLR 
508 498-517 DSPILILDEATSALDTESER 
523 518-528 AIQAALDELOK 
535 531-538 TSLVIAHR 
552 545-561 ADEIVVVEDGVIVER 
965 560-569 GTHNDLLEHR 
574 570-577 GVYAQLHK 
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Magnetic reconnection in current sheets is a magnetic-to-particle 
energy conversion process that is fundamental to many space and 
laboratory plasma systems. In the standard model of reconnection, 
this process occurs in a minuscule electron-scale diffusion region’. 
On larger scales, ions couple to the newly reconnected magnetic- 
field lines and are ejected away from the diffusion region in the 
form of bi-directional ion jets at the ion Alfvén speed*>. Much of 
the energy conversion occurs in spatially extended ion exhausts 
downstream of the diffusion region®. In turbulent plasmas, which 
contain a large number of small-scale current sheets, reconnection 
has long been suggested to have a major role in the dissipation 
of turbulent energy at kinetic scales’"!!. However, evidence for 
reconnection plasma jetting in small-scale turbulent plasmas has 
so far been lacking. Here we report observations made in Earth’s 
turbulent magnetosheath region (downstream of the bow shock) 
of an electron-scale current sheet in which diverging bi-directional 
super-ion-Alfvénic electron jets, parallel electric fields and enhanced 
magnetic-to-particle energy conversion were detected. Contrary to 
the standard model of reconnection, the thin reconnecting current 
sheet was not embedded in a wider ion-scale current layer and 
no ion jets were detected. Observations of this and other similar, 
but unidirectional, electron jet events without signatures of ion 
reconnection reveal a form of reconnection that can drive turbulent 
energy transfer and dissipation in electron-scale current sheets 
without ion coupling. 

The magnetosheath region, which is located downstream of Earth’s 
bow shock and consists of shocked solar wind, is highly turbulent. 
This region often contains hundreds of small-scale current sheets in 
which magnetic reconnection could potentially occur®!%!? (Fig. 1c). 
Many of these sheets are thin (ion inertial length scales or smaller), 
typically convecting past an observing spacecraft in a few seconds or 
less. If standard reconnection (Fig. 1a) were to operate in turbulent 
current sheets, then the ion jets in the extended exhausts should be 
the easiest reconnection signature to detect. Although electric-field 
and magnetic-field structures consistent with standard reconnection 
have been reported*"®, in situ plasma measurements of the jets were 
not obtained for these thin current sheets because the data resolution 
using instruments on previous spacecraft (typically a few seconds per 
velocity measurement) was not sufficient to determine their presence 
or absence. 

The four-spacecraft Magnetospheric Multiscale (MMS) mission, 
launched in 2015 and designed to reveal the kinetic physics of recon- 
nection in near-Earth space, is flying in an electron-skin-depth-scale 
(about 7-10 km) tetrahedral formation. It measures three-dimensional 
electron and ion distributions at up to 7.5-ms and 37.5-ms resolution, 


respectively, which are 400 times and 80 times better resolved than 
previously available data. MMS observations of current sheets in the 
turbulent magnetosheath have revealed thin current sheets!4, fast 
electron flows’*!° and electron heating’”!°. These characteristics are 
similar to those observed in the electron-diffusion region of standard 
reconnection in large-scale current sheets at the outer edge of the 
magnetosphere (the magnetopause)””” and in the laminar magne- 
tosheath (which originate in the solar wind) 18.19 However, ion jets, which 
should occur over a larger scale and therefore be more easily observed if 
standard reconnection is present in the current sheets in the turbulent 
magnetosheath, remain elusive. This raises the question of whether fast 
electron flows in these current sheets are produced by some process(es) 
besides reconnection. 

Here we report the serendipitous simultaneous multi-spacecraft 
detection of oppositely directed super-ion-Alfvénic electron jets, par- 
allel electric fields and magnetic-to-particle energy conversion in an 
electron-scale current sheet in the magnetosheath, providing direct 
evidence for reconnection without ion-scale coupling in turbulence. 

In Fig. 2a—c we show the large-scale context of the MMS observa- 
tions in the subsolar magnetosheath region on 2016 December 9 at 
8:58-9:43 UT, with large fluctuations in both the magnitude of the 
magnetic field (Fig. 2a) and its components (Fig. 2b). Figure 2d-g 
reveals these fluctuations to be sharp changes in the magnetic field asso- 
ciated with large spikes in current density j, many with |j| > 2A m~ 
and comparable to the peak current densities observed in the 
electron-diffusion region at Earth’s magnetopause”!”°. Such large 
current densities across magnetic-field variations of a few tens of 
nanotesla imply current-sheet widths of a few tens of kilometres or 
less, less than the ion inertial length d; of about 50km in this interval. 
Closer inspection of the variations in current density and magnetic 
field throughout the interval in Fig. 2 reveals a range of current-sheet 
thicknesses: many, but not all, are of sub-ion scales. 

To distinguish the regular fast electron flows associated with any thin 
current sheet from electron jets due to reconnection, data should be 
examined in a current-sheet (“LMN’) coordinate system: the current- 
sheet normal points along N, L is along the anti-parallel magnetic- 
field direction and M=N x Lis in the out-of-plane (‘X-line’) direction 
(Fig. 1a). In sucha frame, the main current is in the M direction, while 
the bi-directional reconnection outflows are in the +L directions 
(Fig. 1b). Definitive evidence for reconnection would be the simul- 
taneous detection of oppositely directed plasma outflow jets by two 
spacecraft located on opposite sides of the X-line*. 

Such an event was captured at around 09:03:54 ut, when |j| reached 
approximately 3 1A m7? (red arrow in Fig. 2g). In Fig. 3 we show this 
current sheet in detail, which had a magnetic shear of 14° (the guide 
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Fig. 1 | Schematics contrasting standard magnetic reconnection in 
large-scale current sheets and electron-only reconnection in small-scale 
turbulence. The reconnection configurations in a and b are displayed in 
the current-sheet (LMN) coordinate system. a, In standard reconnection, 
the magnetic topology changes in the small electron-diffusion region 
(EDR) around the X-line, but most of the magnetic-to-particle energy 
conversion happens in the extended exhausts, with bi-directional ion 
jetting and heating. The width of the electron-diffusion region (along 

N) is of the order of the electron skin depth d., whereas its length 

along + L could be up to an ion inertial length, d; = 43d,. The electron- 
diffusion region is embedded in an ion-diffusion region (IDR), whereas the 
magnetohydrodynamic-scale reconnection exhaust can extend thousands 
of d; (along L) away from the X-line?. b, Schematic of reconnection in an 
electron-scale current sheet involving only electrons, with no ion coupling. 
The entire current sheet is essentially the electron-diffusion region, having 

a single (electron) scale with embedded bi-directional super-ion-Alfvénic 
jets. MMS 1 and MMS 3 trajectories through the current sheet relative to the 
X-line are overlaid (dashed arrows), deduced on the basis of the directions of 
the electron jets observed on 2016 December 9 at around 09:03:54 ut (and 
shown in Fig. 3). MMS 1 and MMS 3 were on opposite sides of the X-line 
detecting bi-directional electron jets. The slanted spacecraft trajectories take 
into account the likely motion (in the spacecraft frame) of the X-line due 

to the presence of an external electron flow along + L of about 150kms~!. 

c, Schematic showing the formation of multiple small (d\-scale) magnetic 
structures and thin (d,-scale) current sheets at their interfaces in turbulent 
plasmas, informed by turbulence simulations®!”. 


field was By 40 nT, compared to an anti-parallel field of |B,| +5 nT). 
Four-spacecraft timing analysis finds the thickness of the current sheet 
to be only 4km (or 4 electron skin depths, d,), determined from the 
crossing duration of 45 ms (between the vertical dashed lines in Fig. 3) 
and the convection speed of Vy=95 kms! of the current sheet. 
Inside this electron-scale current sheet, both MMS 3 (left) and MMS 
1 (right) observed fast out-of-plane electron flows (Vey~900kms~}; 
Fig. 3c, m) that produced the main current jy (Fig. 3d, n) and the 
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associated reversal of B; (Fig. 3a, k). The speed V.y is comparable to the 
inflow electron-Alfvén speed of Vaer = 1,000 kms! based on B,~5nT 
and an electron number density of 20 per cubic centimetre. 

Coincident with the intense current layers, MMS 3 and MMS 1 
simultaneously observed oppositely directed electron jets in the 
outflow (L) direction, with A Ve, + +250kms7~! at MMS 3 (Fig. 3c) 
and AV. + —450kms~! at MMS 1 (Fig. 3m), relative to an external 
electron flow in the L direction of Vez ++ 150kms~!. The speeds of 
these electron outflow jets were roughly 10-18 times the asymptotic 
ion-Alfvén speed (based on Bz) of Vai 25kms_!. As expected for a 
reconnection geometry with inflow from both sides, the changes in By 
for MMS 1 are correlated with those in V., in the first part of the field 
change and anti-correlated in the second half, whereas for MMS 3 the 
reverse holds. An exception to this behaviour is that MMS 3, but not 
MMS 1, observed an electron jet with AV. ~ —300 kms! at the right 
edge of the current sheet that is opposite to the main AV, jet (Fig. 3c). 
Simulations of standard reconnection with a strong guide field have 
shown such AVer edge jet?! The lack of a reversed jet at the edge of the 
current sheet at MMS 1 is currently not understood. 

The measurements of oppositely directed electron outflow jets at 
MMS 1 and MMS 3 are further supported by the higher-resolution 
(0.125 ms) measurements of the L component of the field-line velocity 
E x B/B? (where E is the electric field and B is the magnetic field), 
which was negative at MMS 1 and positive at MMS 3 (except for a 
negative dip at the right edge, similar to that in V.;) (Fig. 3q, g). These 
(E x B/B’), outflows were predominantly perpendicular to the mag- 
netic field, owing to the large By (Fig. 3a, k); En (Fig. 3e, 0), which is 
opposite at the two spacecraft, together with the dominant By, drives 
the outflows. Crucially, MMS 3 was located 7.1 km in the+ L direction 
relative to MMS 1 so that the observations are consistent with diverging 
jets from a reconnection X-line located between the two spacecraft as 
they pass through the reconnecting current sheet. There was no evi- 
dence for ion jets (AV;,) at the ion-Alfvén speed (Fig. 3b, 1) within the 
thin current sheet. That ion jets are absent is not surprising, because the 
thickness of the current sheet was only 0.094; (or 0.08 ion gyroradius) 
and the observations were made within 7d, of the X-line. 

What is surprising, however, is that the electron-scale reconnecting 
current sheet was not embedded inside a much larger ion-scale 
current sheet, as would be expected (and observed) in standard recon- 
nection!8-20.22 (Fig. 1a). The absence of an outer ion-scale current 
sheet can be seen in Fig. 3a, k (see also Extended Data Fig. 1), which 
shows B, reaching its asymptotic values immediately outside the thin 
current sheet. 

Both spacecraft detected well defined parallel electric fields 
(Fig. 3f, p), which implies that the ‘electron frozen-in conditio 
(E'=E+ V. x B=0) was violated. Furthermore, j - E’ was positive 
(Fig. 3), t) and dominated by jE), indicating non-ideal magnetic-to- 
particle energy conversion”? characteristic of the electron-diffusion 
region. However, unlike for standard reconnection, where most of the 
magnetic energy is converted into ion jetting and heating, here, half of 
the available magnetic energy per particle (half of 6 eV) in the inflow 
regions (me(Vaer)*, where me is the electron mass) goes into kinetic 
energy associated with AV.y and AV.1, which are 90% and 45% of 
Vaer, respectively. The remaining half (3 eV), if converted entirely into 
electron heating, would lead to an increase in electron temperature of 
(3eV) x (y-1)/yx 1eV in the reconnecting current sheet”4, where 
-y=5/3 is the ratio of specific heats. Such a small temperature increase 
would not be discernible in the data (Fig. 3i, s). 

Within the 21-min burst data intervals shown in Fig. 2a, there were 
34 other isolated current sheets with |j| >2 A m~?, which implies 
sub-ion-scale current-sheet widths. Surprisingly, the majority of these 
current sheets had low magnetic shears (that is, strong guide fields): 
23 of the 34 events had magnetic shear of less than 45°. All 34 sheets 
had a fast out-of-plane electron velocity V.. consistent with the large 
current density j,4, but only 16 displayed clear super-ion-Alfvénic Ver 
jets that could be related to reconnection. In each of these cases, all 
four spacecraft detected V., pointing in the same direction and were 
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Fig. 2 | Overview of MMS 1 observations of turbulent current sheets in 
Earth’s subsolar magnetosheath region downstream of a quasi-parallel 
shock. The observations reveal the presence of large spikes in current 
density (more than 2;1A m~*), which imply sub-ion-scale current sheets. 
The data are displayed in the geocentric solar ecliptic (GSE) coordinates. 
a, b, The magnitude (a; |B|) and components (b; B,, By and B-) of the 
magnetic field. c, The ion energy-time spectrogram of differential energy 
flux (colour scale, in units of eV s~! cm~? sr~! eV~!). d-g, Zoomed-in 
(4-min) interval showing the magnetic field (d; B,, B, and B.), ion velocity 
(e; Vix, Viy and = electron velocity (f; Vex, Vey and V..) and current 
density (g; |j| =j = eN-(Vi— Ve), where e is the elementary charge and Ne 


therefore embedded in the same jet. The scarcity of unambiguous 
reconnection events with divergent jets (which implies that the X-line 
is located between the spacecraft) is probably due to the small (about 
7 km or 7d.) separations between the spacecraft. 

We found no evidence for reconnection ion jetting associated with 
any of the electron outflow jet events, or in any other (including ion- 
scale-thick) current sheets in the 21-min interval (see examples in 
Extended Data Fig. 2). This finding is in stark contrast to standard 
models of reconnection in which the ion exhaust jets should be easier 
to detect than the electron-diffusion region because they extend large 
distances from the X-line. 

The absence of ion reconnection signatures suggests that, in these 
turbulent magnetosheath plasmas, there is insufficient space and/ 
or time for the ions to couple to the magnetic structures. This could 
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is the electron number density) computed from plasma measurements. 
Throughout the interval in a-c, the angle between the magnetic field of 
the solar wind and the Sun-Earth line was less than 30° and the subsolar 
bow shock was quasi-parallel. The spike in current density at around 
09:03:54 ur (indicated by the red arrow in g) is the bi-directional electron 
jet event, shown in detail in Fig. 3. Red horizontal bars in a denote burst 
data intervals totalling 21 min, selected by the MMS scientist on duty 
because they contained a large number of high-amplitude magnetic-field 
fluctuations (with AB/B ~ 0.5), suggestive of current sheets that could be 
prone to reconnection. 


occur not only because the widths of the current sheets are of elec- 
tron scales, but also if the overall dimensions of the current sheets are 
limited because ion coupling requires some minimum lengths along 
the exhaust” (L) and X-line”® (M) directions. A hybrid simulation 
study of resistive reconnection with no guide field”* suggests that ions 
become decoupled when the length of the current sheet in the L direc- 
tion (Fig. 1c) falls below about 10d;. If such an ion decoupling scale 
also exists in strong-guide-field collisionless reconnection, although 
potentially at a smaller scale than 10d;, then it could account for our 
observed lack of ion coupling in reconnection in the magnetosheath, in 
which the coherence scales of magnetic structures have been reported”” 
to be of the order of dj. 

The experimental discovery of electron-only reconnection reveals 
that reconnection operates differently in current sheets with small 
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Fig. 3 | Simultaneous MMS 1 and MMS 3 detections of oppositely 
directed super-ion-Alfvénic electron jets, parallel electric fields and 
enhanced magnetic-to-electron energy conversion in an electron-scale 
current sheet. The data for both spacecraft (MMS 3, a-j; MMS 1, k-t) are 
shown in a common current-sheet (LMN) coordinate system, determined 
for the MMS 3 crossing of the current sheet at 09:03:54.270-09:03:54.365 UT, 
with L = (—0.091, 0.87, 0.49) cep, M= (—0.25, —0.49, 0.83) sp and 

N= (0.96, —0.05, 0.27)csz. The vertical dashed lines mark the left and 
right edges of the current sheet. a, k, Magnetic field Bryn at 8,196 
samples per second (from merged fluxgate and search-coil magnetometer 
measurements**), with By shifted by —30nT. b, c, 1, m, Ion (b, |; Vit) 
and electron (c, m; Vern) velocity. The 7.5-ms electron and 37.5-ms ion 
data products were generated by separating the individual energy sweeps 
that were used to form the nominal burst-mode distribution functions. 
These data maintain sufficient angular coverage to recover accurate 


overall dimensions. Our finding supports the long-held idea that 
reconnection has a role in dissipating the energy associated with 
plasma turbulence in space and astrophysical systems, although the 
scale for dissipation by reconnection would be at the electron scale 
instead of the ion scale. To assess the importance of reconnection 
in dissipating turbulence energy in small systems quantitatively, 
the basic properties of electron-only reconnection (such as the 
rate, duration and onset conditions of reconnection) will need to 
be investigated theoretically and observationally. These properties 
could differ substantially from those known from the standard model 
of reconnection. 


plasma moments at four times the nominal temporal resolution!°. 

d, n, Current density (j:mn) from plasma measurements. e, 0, Electric 
field”* in the spacecraft frame (Eqn) at 8,196 samples per second. 

f, p, Electric-field component parallel to the magnetic field (E))). 

g, q (Ex B/B’)imn velocity. h, Electron density (Ne). i, Electron 
temperature (T.). j, t, j- (E+ V. x B) =j- E’. The LMN coordinate 
system was determined using a hybrid minimum-variance method, 
which often works best in low-magnetic-shear current sheets”’. The 
current-sheet normal direction N was determined from B, x B2/|B, x Ba|, 
where B, and B; are the fields at the two edges of the current sheet. We 
define M=L' x N, where L’ is the direction of maximum variance of 
the magnetic field*°; L= N x M. MMS 3 was located at L= +7.1km, 
M=+3.3km and N=+1.6km relative to MMS 1. Data from all four 
spacecraft are shown in Extended Data Fig. 3. 


Data availability 
The entire MMS dataset is publicly available at https://lasp.colorado.edu/mms/ 
sdc/public/. 
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Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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Extended Data Fig. 1 | Large-scale context of the thin current sheet, 
illustrating the fact that the electron-scale current sheet was a stand- 
alone current sheet, not embedded inside an ion-scale one. Data are 
shown in the LMN coordinates determined for the thin current sheet and 
used in Fig. 3. a, Magnetic field. b, Ion velocity. c, Electron velocity. 

d,j- (E+ V. x B) =j- E’. The thin reconnecting current sheet stands out in 
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this interval, with nothing else approaching its current density or its value 
of j- (E+ V. x B). The absence of an ion-scale current sheet enveloping 
the electron-scale current sheet is indicated by the fact that |B,| reaches 
essentially its asymptotic values immediately outside the thin current 


sheet. 
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Extended Data Fig. 2 | Absence of reconnection ion jetting. The data 
are in GSE coordinates. a, Magnetic field. b, Ion velocity. c, d, y (c) and 
z(d) components of the ion velocity (V;) and Alfvén velocity (Va). Va is 
relative to the reference velocity, density and magnetic-field values at the 
left edge of the data interval: Vy = Brer(1—Orer) (to Preg) /?—B(1—a) 
(op) “?, where io is the vacuum permeability, a = (pj—p )/40/B? is the 
pressure anisotropy factor and p is the mass density of the plasma’. The 
expected speeds of the ion reconnection jets embedded inside many of the 
large-magnetic-shear current sheets are in the range of 100-200 kms“, 


based on B, variations of around 20—40ntT (a). If present, such jets are 
readily recognized by back-to-back opposite correlations between ion 
velocity and magnetic-field variation at the two edges of the current sheet, 
which indicate pairs of rotational discontinuities emanating from the 
X-line®. These signatures are not seen here. What we instead find in the 
data is either no correlation between components of V; and B, or a single 
correlation (or anti-correlation), indicative of Alfvénic structures!® rather 
than reconnection jetting. 
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Extended Data Fig. 3 | Four-spacecraft observations of the reconnecting current sheet, which is the 4.5-km separation distance between the two 
current sheet. A common current-sheet LMN coordinate system (same spacecraft along N (j). This is consistent with the current-sheet width of 
as in Fig. 3) was used for consistency, justified by the fact that the LMN 4km determined from the motion and crossing duration of the current 
coordinates at individual spacecraft differ from each other by less sheet. Inside the current sheet, MMS 4 detected a positive (E x B/B*), 
than 4°. a-d, L (a, c) and M (b, d) components of the magnetic field (except at the right edge), similarly to MMS 3, whereas MMS 2 detected a 
(a, b) and of the electron velocity (c, d). e, M component of the current negative (E x B/B’),, similarly to MMS 1. This indicates that there was a 
density. f, L component of the E x B/B? velocity. g, N component of the pair of spacecraft on each side of the X-line. All four spacecraft detected a 
electric field. h, Electric-field component parallel to the magnetic field. predominantly negative E). The parameter j - (E+ V. x B) was consistently 
i,j- (E+ V. x B) =j- E’.j, Spacecraft locations relative to MMS 1, in positive at all four spacecraft throughout the current sheet, with the 
kilometres (roughly d,). The By profiles (a) show that MMS 1 and MMS 3 amplitude being lowest at MMS 2. MMS 2 also detected the largest guide- 
crossed the current sheet at essentially the same time, preceded by MMS field (By) compression, fastest A V.z and (E x B/B’), jets, slowest AVem 
4 and followed by MMS 2. The fact that MMS 4 exited the current sheet and weakest E), which together may suggest that MMS 2 was furthest away 
before MMS 2 entered it places an upper limit on the thickness of the from the X-line. 
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Precision measurement of the weak charge of 


the proton 


The Jefferson Lab Qweak Collaboration* 


Large experimental programmes in the fields of nuclear and particle 
physics search for evidence of physics beyond that explained by 
current theories. The observation of the Higgs boson completed the 
set of particles predicted by the standard model, which currently 
provides the best description of fundamental particles and forces. 
However, this theory’s limitations include a failure to predict 
fundamental parameters, such as the mass of the Higgs boson, and 
the inability to account for dark matter and energy, gravity, and 
the matter-antimatter asymmetry in the Universe, among other 
phenomena. These limitations have inspired searches for physics 
beyond the standard model in the post-Higgs era through the direct 
production of additional particles at high-energy accelerators, 
which have so far been unsuccessful. Examples include searches 
for supersymmetric particles, which connect bosons (integer- 
spin particles) with fermions (half-integer-spin particles), and 
for leptoquarks, which mix the fundamental quarks with leptons. 
Alternatively, indirect searches using precise measurements of 
well predicted standard-model observables allow highly targeted 
alternative tests for physics beyond the standard model because they 
can reach mass and energy scales beyond those directly accessible 
by today’s high-energy accelerators. Such an indirect search aims to 
determine the weak charge of the proton, which defines the strength 
of the proton’s interaction with other particles via the well known 
neutral electroweak force. Because parity symmetry (invariance 
under the spatial inversion (x, y, z) + (—x, —y, —z)) is violated 
only in the weak interaction, it provides a tool with which to isolate 
the weak interaction and thus to measure the proton’s weak charge’. 
Here we report the value 0.0719 + 0.0045, where the uncertainty is 
one standard deviation, derived from our measured parity-violating 
asymmetry in the scattering of polarized electrons on protons, which 
is —226.5 +9.3 parts per billion (the uncertainty is one standard 
deviation). Our value for the proton’s weak charge is in excellent 
agreement with the standard model’ and sets multi-teraelectronvolt- 
scale constraints on any semi-leptonic parity-violating physics not 
described within the standard model. Our results show that precision 
parity-violating measurements enable searches for physics beyond 
the standard model that can compete with direct searches at high- 
energy accelerators and, together with astronomical observations, 
can provide fertile approaches to probing higher mass scales. 

In the electroweak standard model, elastic scattering is mediated by 
the exchange of neutral currents (virtual photons and Z° bosons) 
between fundamental particles. A particle's weak charge, Qy, is analo- 
gous to—but distinct from—its electric charge, q; the former quantifies 
the vector coupling of the Z° boson to the particle, whereas the latter 
quantifies the vector coupling of the photon to the particle. The proton’s 
weak charge Q? is defined! as the sum of the weak vector coupling 
constants Cj, of the Z° boson to its constituent up (u) and down (d) 
quarks: 


Qh = —2(2Ciy+ Cia) (1) 


The Z° exchange contribution to electron-proton scattering can be 
isolated via the weak interaction’s unique parity-violation signature 


(see Fig. 1). Interference between electromagnetic and weak scatter- 
ing amplitudes leads to a parity-violation asymmetry Ag, that can be 
measured with a longitudinally polarized electron beam incident on 
an unpolarized-proton target: 

O,-O_ 


Asap 
ep o,+o_ (2) 


Here, o1 represents the cross-section of the helicity-dependent elastic 
scattering (ep) of polarized electrons (e) on protons (p), integrated over 
the scattered-electron detector acceptance. Helicity (+1) indicates the 
spin direction of the electrons in the beam as either parallel (+1) or 
anti-parallel (—1) to their momenta. 

Measuring Ag, at a small four-momentum transfer ( Q’) suppresses 
contributions from the proton’s extended structure relative to the pro- 
ton’s weak charge QP. However, Aep is small at small Q? values, making 
its measurement challenging. In the low-Q? limit, the parity-violation 
asymmetry can be expressed! as: 


Agp/Ag= QE + Q°B(Q’, A) (3) 


where Ay = —G,Q? /(4na-/2), —Q? is the four-momentum transfer 
squared, B(Q?, 0) represents the proton’s extended structure defined in 
terms of electromagnetic, strange and axial form factors, 6 is the (polar) 
scattering angle of the electron in the laboratory frame with respect to 
the beam axis, Gp is the Fermi constant and aq is the fine-structure 
constant. 

The Qweak experiment? (see Extended Data Fig. 1) used a beam 
of longitudinally polarized electrons accelerated to 1.16 GeV at the 
Thomas Jefferson National Accelerator Facility. Three sequential 
acceptance-defining lead collimators, matched to an eight-sector 
azimuthally symmetric toroidal magnet, selected electrons scattered 
from a liquid-hydrogen target at angles between 5.8° and 11.6°. In 
each magnet octant, elastically scattered electrons were directed toa 
quartz detector fronted by lead pre-radiators. Cherenkov light pro- 
duced by the electromagnetic shower passing through the quartz was 
converted to a current by photomultiplier tubes at each end of the 
quartz bars. These currents were integrated for each 1-ms-long heli- 
city state of the beam. For calibration purposes, and to confirm our 
understanding of the acceptance and backgrounds, drift chambers 
were periodically inserted upstream and downstream of the magnet 
to track individual particles during dedicated periods of low-current 
running. 

To achieve a precision of less than 10 parts per billion (p.p.b.), 
this experiment pushed existing boundaries on many fronts: higher 
polarized-beam intensity (180 j1A), faster beam-helicity reversal 
(960s), better GeV-scale beam polarimetry° precision (-+0.6%), 
higher liquid-hydrogen target® luminosity (1.7 x 10°°cm~? s~') 
and cooling power (3 kW) and higher total detection rates (7 GHz). 
Following a brief commissioning period, the experiment was divided 
into two roughly six-month run periods, between which improvements 
were made primarily to polarimetry and helicity-related beam- 
monitoring and control instrumentation. Further details, including the 


*A list of participants and their affiliations appears at the end of this Letter. 
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Fig. 1 | Parity-violating electron scattering from the proton. An 
incoming electron, e, with helicity +1 scatters away from the plane of 
the ‘parity-violating mirror’. The image in the parity-violating mirror 
shows the incoming electron with the opposite helicity, —1; instead of 
scattering into the plane of the parity-violating mirror (as it would in a 
real mirror), it scatters out of the plane of the parity-violating mirror. 
The dominant electromagnetic interaction, mediated by the photon 

(7; blue wavy line), conserves parity. The weak interaction, mediated 

by the neutral Z° boson (dashed red line), violates parity. The weak 
interaction is studied experimentally by exploiting parity violation through 
reversals of the incident-beam helicity, which mimic the parity-violating 
mirror ‘reflection. 


backgrounds and corrections associated with each of the two halves of 
the experiment, are provided in Methods. 

The asymmetry measurement results are A.p = —223.5 + 15.0 
(statistical) + 10.1 (systematic) p.p.b. in the first half of the experi- 
ment, and Aep = —227.2 + 8.3 (statistical) + 5.6 (systematic) p.p.b. in 
the second half. These values are in excellent agreement with each 
other and consistent with our previously published commissioning 
result’, Accounting for correlations in some systematic uncertainties 
between the two measurement periods, the combined result is 
Aep = —226.5 + 7.3 (statistical) + 5.8 (systematic) p.p.b. The total 
uncertainty achieved (9.3 p.p.b.) sets a new level of precision for 
parity-violating electron scattering (PVES) from a nucleus. 

The relationship between the measured asymmetries Aep and the 
proton’s weak charge QP is expressed by equation (3), where the 
hadronic-structure-dependent term B(Q’, 0) grows with the momen- 
tum transfer Q?. Higher-Q? data from previous PVES experiments (see 
online references, Methods) were included in a global fit®”* to con- 
strain the proton-structure contributions for the short extrapolation 
from our datum to Q?=0 in order to determine Q?, the intercept of 
equation (3). The average Q? of this experiment (0.0248 GeV*c~”) is 
much smaller than that of any other PVES experiments used in this fit, 
with correspondingly smaller contributions from the proton structure. 
The superior precision of the Qweak measurement tightly constrains the 
fit near Q? =0, where the connection to Q? can be made. 

The parameters of the global fit*”* to the PVES data are the 
axial-electron—vector-quark weak-coupling constants Cj, and Cjq, the 
strange charge radius p, and strange magnetic moment j1; (which char- 
acterize the strength of the proton’s electric and magnetic strange-quark 
form factors) and the strength of the neutral weak (Z° exchange) isovector 
(T= 1) axial form factor G : (T=) The EM form factors Gz and Gy used 
in the fit were taken from ref. ?; uncertainties in this input were 
accounted for in the result for QF and in its uncertainty. 

The ep asymmetries shown in Fig. 2 were corrected!” for the energy- 
dependent part of the yZ-box weak radiative correction! and its 
uncertainty. No other electroweak radiative corrections need to be 
applied to determine Q?. However, ordinary electromagnetic radiative 
corrections (bremsstrahlung) were accounted for in the asymmetries 
used in the fit, including our datum. Details of the fitting procedure, as 
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Fig. 2 | The reduced asymmetry A.,/Ay =Q? + Q’7B(Q?, = 0) versus Q’. 
The global fit is illustrated using ep asymmetries from this experiment 
(Qweak 2018), from the commissioning phase of this experiment? (Qweak 
2013), as well as from the earlier experiments HAPPEX, SAMPLE, PVA4 
and GO (see Methods), projected to @=0° and reduced by a factor Ao(Q’) 
appropriate for each datum. The data shown here include the yZ-box 
radiative correction and uncertainty. Inner error bars correspond to one 
standard deviation (s.d.) and include statistical and systematic uncertainties. 
Outer error bars on the data indicate the additional uncertainty estimated 
from the forward-angle projection (for some data points, inner and outer 
error bars coincide). The solid line represents the global fit to the complete 
PVES database (see Methods), and the yellow band indicates the fit 
uncertainty (1 s.d.). The arrowhead at Q’=0 indicates the standard-model 
prediction’, Qr = 0.0708(3), which agrees well with the intercept of the fit 
(Q? = 0.0719 + 0.0045). The inset shows a magnification of the region 
around this experiment’s result, at (Q’) = 0.0248 GeV c ”. 


well as a description of the corrections applied to the asymmetry for 
this experiment, are described in Methods. 

The global fit is shown in Fig. 2 together with the ep data, expressed 
as AglQ, 6 =0)/Apo. To isolate the Q? dependence for this figure, 
the 0 dimension was projected to 0° by subtracting [Acaic(Q”, 0) — 
Agaic(Q?, 6 =0)] from the measured asymmetries Bal. 0), as 
described in refs **. Here Acai. refers to the asymmetries determined 
from the global fit. The fit includes all relevant PVES data for the 
scattering of polarized electrons on protons (ep), deuterons (e’H) and 
“He (e*He); see Methods. The PVES database provides a data-driven 
(as opposed to a more theoretical) constraint on the nucleon structure 
uncertainties in the extrapolation to Q? =0. We consider this to be 
the best method to provide our main result (denoted in Table 1 as 


Table 1 | Results extracted from the asymmetry measured in the 
Qweak experiment 


Method Quantity Value Error 

PVES fit Qe 0.0719 0.0045 
Ps 0.20 0.11 
Ls —0.19 0.14 
GZ-)) —0.64 0.30 

PVES fit+ APV Fi 0.0718 0.0044 

ie —0.9808 0.0063 

Ciu —0.1874 0.0022 
Cig 0.3389 0.0025 
Ci correlation —0.9318 

PVES fit+ LQCD Qn 0.0685 0.0038 

Qweak datum only QP 0.0706 0.0047 

Standard model Qn 0.0708 0.0003 


‘PVES fit’ refers to a global fit incorporating the Qwea, result and the PVES database, as described 
in Methods. When combined with APV!415 (to improve the Cig precision), this method is denoted 
as ‘PVES fit+ APV’. If the strange form factors in the global fit (without APV) are constrained to 
match LQCD calculations!®, we label the result as ‘PVES fit+ LQCD’. The method labelled ‘Qweak 
datum only’ uses the Qweax datum, together with electromagnetic, strange!® and axial!8 form 
factors from the literature in lieu of the global fit. Uncertainties are 1s.d. 
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Fig. 3 | Variation of sin?6w with energy scale Q. The modified-minimal- 
subtraction (MS) scheme is shown as the solid curve*'’, together with 
experimental determinations at the Z° pole? (Tevatron, LEP1, SLC, LHC), 
from APV on caesium!*"», Moller scattering (E158)”’, deep inelastic 
scattering of polarized electrons on deuterons (e*H; PVDIS)” and from 
neutrino-nucleus scattering (NuTeV)~*. It has been argued”, however, 
that the latter result contains substantial unaccounted-for nuclear physics 
effects, such as neutron-excess corrections to the quark momenta, charge- 
symmetry breaking and strange-quark momentum asymmetries. Our new 
result is plotted in red at the energy scale of the Qweak experiment, 
Q=0.158 GeV (slightly offset horizontally for clarity). Error bars (1 s.d.) 
include statistical and systematic uncertainties. 


‘PVES fit’), which is Q? = 0.0719 + 0.0045. Below we discuss the sensi- 
tivity of this result to variations in the experimental and theoretical 
input used to determine it. 

Just as the proton’s weak charge depends on its u and d quark content 
(see equation (1)), the weak charge of other nuclear systems depends 
on their (different) u and d quark content. Because ep, e7H and e*He 
data are included in the global fit, Cj, and C)4 are reasonably well deter- 
mined. However, if the very precise atomic-parity violation (APV) 
result!*!5 on Cs is also included in the global fit, C}, and Cjq can be 
determined with greater precision and then used to extract the neu- 
tron’s weak charge Qi = —2(C,,,+2C,,). We note that inclusion or 
exclusion of the APV result has negligible impact on our result for QP, 
which is derived from the intercept of the global fit. The results for Cy, 
Cia QF and Q* obtained by including APV in the PVES global fit, 
which are listed in Table 1 as ‘PVES fit + APV, are in agreement with 
the standard-model values’. 

While our preferred result is based on the data-driven analysis of 
PVES fit, the final determination of the weak charge of the proton 
does not change appreciably with additional theoretical constraints. 
One of the dominant uncertainties in the term B(Q?, 0) of equation 
(3) arises from the knowledge of the strange-quark contributions. 
These have been determined very precisely in recent theoretical 
calculations'®!” employing lattice quantum chromodynamics 
(LQCD). Using these theoretical results to constrain the extrapolation 
to Q?=0 results in a slightly lower weak charge and a reduction in 
the uncertainty, as shown in Table 1 (“PVES fit + LQCD’). The APV 
result was not included in this determination of Q!; its inclusion 
makes negligible difference. 

Because the proximity to threshold (Q*— 0) and precision of our 
Qweak result overwhelmingly dominate the fits described above, it is 
possible to go one step further and use the Qweax datum by itself to 
determine Q?. The fact that the strange and axial form factors contri- 
bute so little at the kinematics of the Qweak experiment (0.1% and 2.5%, 
respectively) also helps motivate this consistency check. Using the same 
electromagnetic form factors? as in the fits above, the same lattice 
calculation’* for the strange form factors, and following the extraction 
method of ref. '® for the axial form factor, the QP result obtained by 
using just the Qweak datum falls in-between the consistent results of the 
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Fig. 4 | Mass and coupling constraints on new physics. a, Constraints, 

at the 95% confidence level, on the axial-electron—vector-quark weak- 
coupling constants C), and Cig, derived from the weak charge determined 
in this experiment using the global fit method ‘PVES fit’ (blue band) and 
the APV result”!*'* on '°3Cs (gold band). The combined (95% confidence 
level) constraint is shown by the black ellipse. Contours of the mass reach 
Alg for new physics with coupling g to arbitrary quark-flavour ratios are 
indicated by dashed circles centred about the standard-model values” 

of Cy and C4, which are denoted by the red square. b, Mass reach A/g 
(95% confidence level) as a function of the quark-flavour mixing angle 

O, for the Qweak ‘PVES fit’ result (blue curve), for the °3Cs APV!4}5 
result” (gold curve) and for both results combined (black curve). The two 
maxima in the blue curve at 6, = tan” !(ng/n,) =tan7!(1/2) = 26.6° and 
206.6° correspond to A_/g=8.4 TeV and A;/g=7.4 TeV in equation (4), 
respectively. 


other determinations described above, which employ the entire PVES 
database (see Table 1, “Qweak datum only’). The uncertainty of the Q? 
result in this case includes an additional uncertainty (4.6 p.p.b.) due to 
the calculated form factors, but is only 4% larger than the uncertainty 
of the global fit result, which uses the entire PVES database. The dom- 
inant correction, from the electromagnetic form factors (23.7%), is well 
known in the low-Q? regime of the Qweak experiment. 

The QF? determinations described above can be used to test the 
prediction of the standard model for sin?6, the fundamental 
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electroweak parameter characterizing the mixing of the electromag- 
netic and weak interactions in the standard model (Ow is the Weinberg 
angle). By neglecting small quantum corrections, the standard model 
derives!*° sin?@w in terms of the electroweak boson masses, Mw 
and My, as sin’O,w = 1 — (Mw/M,)° © 1/4, and so QP = 1 — 4sin’Ow 
is nearly zero. This ‘accidental’ standard-model suppression of Q? 
makes it an ideal observable in the search for new parity-violation inter- 
actions of natural size”!. Using the latest input” to calculate quantum 
corrections that relate! sin?@w(Q=0) to QP, as described in Methods, 
we obtain sin’Oy(Q = 0)yg = 0.23830.0011 in the modified- 
minimal-subtraction (MS) scheme”!?. 

Figure 3 shows our sin?@ result (after subtracting 0.00012 to plot it 
at the Qweak energy scale, 0.158 GeV), along with other sin? deter- 
minations””°?-*>, Our value is consistent with the standard-model 
expectation and the purely leptonic E158 result” obtained in Moller 
(ee) scattering, which has different sensitivities to new physics from our 
semi-leptonic (ep) result. Although the measurements at the Z° pole 
are more precise than our result, there exist a variety of beyond-standard- 
model (BSM) scenarios that can have considerable influence on 
low-energy precision measurements but little effect on collider meas- 
urements at the Z°-mass energy scale*®. An example is the dark-photon 
model of ref. 7”, which allows large effects for dark Z mediators of a 
few hundred megaelectronvolts at low Q, but no effects at the Z” pole. 

To explore this experiment’s sensitivity to new BSM contact interac- 
tions, we follow the convention”® where a ‘new physics’ term, gl A?, is 
added to the standard-model term git, / (2v*) used in the Lagrangian 
to describe the neutral-current interaction of axial-vector electrons 
with vector quarks. (This convention differs from an earlier one! by a 
factor of 4; a factor of 2 in A). Here re =, q= 2g 84 is the standard- 


model axial-electron-vector-quark coupling, v? = /2 /(2G,), and A 
represents the mass reach for new physics (the mass of the hypothetical 
BSM particle being exchanged) with coupling g. Expressed in terms of 
Q? and its uncertainty, + AQP, the 95%-confidence-level mass reach is 


Ay 4/5 
g : Pt 1.96AQ? — Q?(SM)| 


where A_./g=7.4TeV and A_/g=8.4 TeV. 

For the extreme contact-interaction coupling” g” =4r, the maximum 
mass reach for new semi-leptonic physics determined by our QP result 
is A, = 26.3 TeV. Using the coupling g” = 4na, which is typically 
assumed for leptoquarks*® (hypothetical BSM particles with both 
lepton and baryon numbers; see also http://pdglive.lIbl.gov/DataBlock. 
action?node=S056ESP&init=0), equation (4) rules out leptoquark 
masses below 2.3 TeV. 

In a more general case® where the coupling of the new physics to 
the quarks is not restricted to the same 2:1 ratio of u to d quarks as 
in the proton, the mass reach can be expressed as a circle of 
radius (¢/A)? V2 /Gp , about the standard-model origin 


(CSM, CSM) = (—0.1887, 0.3419) in C,q space. This is illustrated in 
Fig. 4a, which also shows the complementary constraints on 
C,q provided by the weak-charge measurements of this experi- 


ment (Q? = —2(2C,,, + C,q)) and the °Cs APV'*!5 result? (QC= 


—2[55(2C,,, + C4) + 78(C,,, + 2C,4)]). The isospin dependence of 
potential BSM physics can be expressed in terms of the projections 
ho = cos, and hd = sin6,, where is the quark-flavour mixing angle, 
the superscripts u and d refer to the quark flavour and the subscript 
refers to the vector piece, as in ref. °. We then obtain the 95%- 
confidence-level bounds from this experiment, which are shown in 
Fig. 4b, together with those from APV*'*!° and from combining those 
two constraints. New physics is ruled out below the curves. Our com- 
bined constraint raises the O,-independent limit for generic new 
semi-leptonic parity-violation BSM physics to 3.5 TeV. 

The low-energy precision frontier continues to offer an exciting 
landscape to search for BSM physics. The results of this experiment are 


(4) 
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consistent with the standard model and place important limits on new 
beyond-standard-model physics. Future experiments propose to pro- 
vide even more precise (and much more challenging) determinations 
of QP at lower Q? (ref. 3!) and of Q< (ref. 3) by taking the techniques 
and lessons learned in this experiment to the next level. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0096-0. 
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METHODS 


Here we describe the formalism connecting the experimentally measured asym- 
metry A¢p to the proton’s weak charge Q?, the experimental method used to deter- 
mine Agp, some measures of the data quality, and the electroweak radiative 
corrections needed to extract the weak mixing angle from Q?. 

Formalism. The Qweak experiment measured the parity-violating asymmetry Aep, 
which is the normalized difference between the elastic-scattering cross-sections () 
of longitudinally polarized electrons with positive (a) and negative (o_) helicity 
from unpolarized protons: 


Ae, = 
2 Gta. (5) 


O,—-6. 


The extended structure of the proton can be represented using form factors. 
Assuming charge symmetry, the single-boson exchange (‘tree-level’) expression 
for the asymmetry can be written in terms of the proton and neutron electromag- 
netic form factors Gf, Gf,, Gg and Gy), the strange electric and magnetic form 
factors G, and G,, and the neutral weak axial form factor Gas”? 


AY + AP + AP 


=A (6) 
Gh) rtGe) 
with 
2 
Ay= mi 
4naJ/2 
AP = QP[e(GB)* + 7(GR)] 
— [eGhGe+7GhGul 


A’ = —eGG; —TGEGy 
AP = —(1 — 4sin’@w)e/GP,GR 
QP = —2(2C,, + Cig) 


where the subscripts v, s and a refer to the vector, strange and axial contributions 
to the asymmetry, respectively, 


and ¢e/=,/7r(1+7)(1—e’) 


a 
1+2(1 + 7)tan?(0/2) 


are kinematic quantities and t= Q’/(4M°), where M is the proton mass. 

By taking the limits 0 + 0 and Q? — 0, equation (6) is reduced** to equation (3). 

Although similar relationships apply for parity-violating elastic scattering from “He 
and quasi-elastic scattering from 7H, these involve different linear combinations 
of C)y and Cg and the various form factors***°. 
Global fit. Using equation (6), a global fit was made to a database that included all 
relevant PVES data up to Q?= 0.63 GeV? c-*: 28 electron—proton scattering results 
obtained by the G0**?’, HAPPEX**41, SAMPLE™, PVA4*** and Qweak® collabora- 
tions, including the present result, two ‘He elastic scattering results (HAPPEX*™*) 
and five °H quasi-elastic scattering results (G0*”, PVA4‘***” and SAMPLE*). 

For the fit, we followed the procedure introduced by Young et al.”, as used in 
refs **. We used six parameters in the fit; namely, C),, Cia and four parameters that 
characterize the strange and axial form factors: the strange radius p,, the strange 
magnetic moment j1; and the magnitudes of Gi and Gy. The parameterizations 
chosen for the strange form factors were Gp = 2 Q’G, and Gy, = Gq, with the 
dipole form factor Gy = (1 — Q”/A’)*, where \= 1 GeV c!. Young et al.” exami- 
ned the consequences of using more elaborate Q’ dependences for Gf and G;, and 
found that they were not required for the data. More recent analyses have come to 
the same conclusion!*“?, LQCD calculations!®” have also found similar shapes for 
Gj,and Gy. The neutron’s axial form factor G, enters the fit through the inclusion 
of 7H data in the database. For the isovector combination G¢'7=) = (GR — GR) /2, 
the dipole form Gg was again adopted, with the normalization factor as the parameter. 
Because the isoscalar combination G¢‘7~ = (GR + G)/2._ is known from the 
theory to be small, it was constrained in the fit to the theoretical value”? 
GxF — _0,0840.26, reducing the effective number of parameters to five. The 
unconstrained isovector combination Gee) presented in Table 1 was con- 
structed from the values of GR and Gi determined in the fit, which were 
GR = —0.59+ 0.34 and G4 = 0.68 + 0.44 with a covariance of —0.0265. 

The electromagnetic form factors Gf, GP, Gp and Gy, were taken from ref. °. 
If, instead, we use any one of the several alternative parameterizations for these 
form factors*!~*?, the fitted result for QP? changes by less than 1%. We have incor- 
porated this range as a systematic uncertainty. 

Each of the experimental asymmetries Ap used in the fit needed to be corrected 
for the one electroweak radiative correction with considerable energy dependence, 
the yZ-box diagram!. Three independent theoretical groups have calculated this 


correction, with results in excellent mutual agreement’®'?**, We have adopted the 
most recent, which was obtained from a data-driven calculation of the vector!” 
(0.0054(4)) and axial-vector!!? (—0.0007(2)) contributions, multiplied by a small 
Q correction!’ (0.978(12)); the numerical values given here are in terms of the 
effect on the reduced asymmetry Aep/Ao. The total correction to our datum is 
0.0046(5), corresponding to a 6.4% + 0.6% correction to Qe. If, instead, we use the 
calculations of either ref. !° or ref. 54, the extracted value of Q? is essentially 
unchanged, and the effect on its uncertainty is negligible. 

The fit was done using linear x? minimization, with a resulting y7/d.0.f.= 1.25 

for 29 degrees of freedom (d.o.f.). 
Apparatus. The Qweak experiment was performed using a custom apparatus 
installed in Hall C at the Thomas Jefferson National Accelerator Facility. A com- 
plete description of the apparatus and critical aspects of the accelerator can be 
found in ref. 4, Here we highlight some of the most important details illustrated 
in Extended Data Fig. 1. 

The electron beam was longitudinally polarized and its helicity was reversed 
at a rate of 960 Hz in a pseudorandom sequence of ‘helicity quartets’ (+ — — +) 
or (— ++ —). The quartet pattern minimized noise due to slow linear drifts, and 
the rapid helicity reversal limited noise due to fluctuations in the target density 
and beam properties. As a test for possible false asymmetries, two methods were 
used to reverse the beam helicity on a slower timescale than the rapid helicity 
reversal. About every 8h, the helicity of the laser beam that was used to generate the 
polarized electron beam was reversed by insertion of a half-wave plate in its path. 
The helicity of the electron beam in the accelerator’s injector region was reversed 
monthly using a double Wier’ spin rotator?’. Non-vanishing correlations between 
the properties of the electron beam (intensity, position, angle and energy) and its 
helicity lead to false contributions to the measured asymmetry. These helicity- 
correlated beam properties were suppressed by carefully setting up the helicity- 
defining optics and active-feedback systems in the polarized injector laser system. 
The electron beam transport line was instrumented to allow the correction of 
residual helicity-correlated beam properties. Beam monitors upstream of the 
experimental apparatus provided continuous, non-invasive measurement of the 
electron beam’s intensity, position, angle and energy. The response of the experi- 
mental apparatus to changes in the beam properties was periodically measured 
using a beam modulation system that generated controlled variations in the beam’s 
position and angle using magnets and in its energy using a radio-frequency accele- 
rating cavity. Finally, the electron beam’s polarization was measured with two inde- 
pendent polarimeters upstream of the Qweak apparatus. 

The experiment usually ran with a 180-;1A beam of 1.16-GeV, ~88% longi- 
tudinally polarized electrons incident on a 34.4-cm-long liquid hydrogen target 
placed in an aluminium cell and maintained at 20K. A set of three lead collima- 
tors restricted the polar scattering angle acceptance to the range 5.8° < @< 11.6° 
with an azimuthal angle coverage of 49% of 2m. A resistive toroidal magnet 
between the target and detectors separated the elastically scattered electrons 
of interest from irreducible backgrounds arising from inelastic and Moller 
scattering. The resulting scattered electrons were detected in eight synthetic- 
quartz Cherenkov detectors, each 200 cm x 18cm x 1.25cm, arranged in 
an azimuthally symmetric pattern about the beam axis. Fourfold azimuthal 
symmetry was important to minimize and characterize the effects of helicity- 
correlated beam properties and residual transverse beam polarization. These 
‘main’ detectors were equipped with 2-cm-thick Pb pre-radiators that amplified 
the electron signal and suppressed low-energy backgrounds. Light was collected 
from the detectors by photomultiplier tubes (PMTs) at each end. The high rates 
of ~800 MHz per detector required a current-mode readout, in which the PMT 
anode current was converted to a voltage that was digitized and integrated 
about every millisecond. Photographs of the apparatus are shown in Extended 
Data Fig. 1. 

The collimator-magnet configuration and carefully designed shielding were 
effective in minimizing reducible sources of background, such as direct line-of- 
sight (neutral) tracks originating in the target and secondary scattering from 
the beampipe. The residual diffuse background was monitored by background 
detectors in the main-detector shield house. These consisted of a complete main 
detector placed in the super-elastic region, a dark box with a bare PMT and a 
dark box with a PMT attached to a light guide. A symmetric array of four smaller 
detectors (‘upstream luminosity monitors’) placed on the upstream face of the 
defining (middle) collimator was effective in monitoring residual backgrounds 
from the tungsten beam collimator that shielded the downstream region from 
small-angle-scattered particles. 

Finally, a tracking system consisting of drift chambers located before and after 
the magnet was deployed periodically to verify the acceptance-weighted kinematic 
distribution and to help study the backgrounds. These measurements were done in 
special run periods at very low beam currents (0.1-200 nA) and using conventional 
individual pulse counting. 
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Data from the experiment'’s short commissioning run (4% of the size of the 

dataset reported here) have been previously published’. Here we report the com- 
bined result from two run periods (referred to as Run 1 and Run 2), each about six 
months in duration. Because improvements were made to the apparatus between 
the two run periods and their beam conditions were different, we report the cor- 
rections and systematic uncertainties for each run period separately. To prevent 
possible biases in the analysis, the main-detector asymmetries were blinded by 
an additive shift in the asymmetry that was different for each run period. When 
the data analysis was complete, the asymmetries were unblinded, revealing the 
results presented here. 
Data analysis. The raw asymmetry Aray was formed from the normalized 
cross-section difference of equation (5) over the sum of the beam-charge norma- 
lized detector PMT signals, summed over the 8 detectors. The measured asymme- 
try Amsr was calculated from A;aw by correcting for a variety of effects that could 
cause false asymmetries: 


Annsr = Ajay + A, “he A “ts Abe + App “ti Abeam an Abjas (7) 


The methods for determining each of these corrections and their uncertainties 
are discussed below. 

Transverse asymmetry correction, A;. A small (~2%) residual transverse component 
of the incident beam polarization resulted in a transverse asymmetry Aj, which is 
driven by a parity-conserving two-photon-exchange amplitude’. To determine 
this correction, we measured the transverse asymmetry using a maximally trans- 
versely polarized incident beam*’, which we used to calculate an upper bound on 
the broken symmetry of the spectrometer-detector system (<1.3%). By combining 
these effects with the measured residual transverse polarization components of 
the beam during data-taking with a nominally longitudinally polarized beam, we 
determined the correction to be A;=0.0+ 1.1 p.p.b. and Ay =0.0 + 0.7 p.p.b. for 
Run 1 and Run 2, respectively. 

Linearity correction, Aj. The nonlinearity of the Cherenkov detector readout 
chain (PMT, low-noise voltage-to-current preamplifier and analogue-to-digital 
converter) was directly measured using a system with light-emitting diodes and 
was found to be 0.7% + 0.5% at signal levels corresponding to those recorded dur- 
ing the experiment. This results in an additive correction Aj= 1.3 + 1.0 p.p.b. and 
Aj=1.2+0.9 p.p.b. for Run 1 and Run 2, respectively. 

Beam current monitor correction, Apem. The beam current was measured non- 
invasively with radio-frequency resonant cavities to allow a precise relative 
comparison of the beam charge in each helicity state. Two such beam current 
monitors (BCMs) were used in Run 1, while three BCMs were used in Run 2 after 
the installation of an additional monitor and improvement of the low-noise digital 
demodulation electronics. The correction Apcm is zero by definition because in 
each period we normalize the integrated detector signals to the average of the 
BCM signals. The systematic uncertainty on this correction is determined from the 
variation in the reported charge asymmetry for the BCMs in each analysis, resulting 
in 6Apem = £4.4 p.p.b. and bApem = +2.1 p.p.b. for Run 1 and Run 2, respectively. 
Beamline background asymmetry correction, App. False asymmetry caused by 
secondary events scattered from the beamline and the tungsten beam collimator 
is referred to as the beamline background asymmetry App. Such events were deter- 
mined to be predominantly due to low-energy neutral particles and contributed a 
small amount to the signal (0.19%) but had a large asymmetry, which is believed 
to be associated with a helicity-dependent intensity or position variation in the 
extended halo around the main accelerated beam. Although their contribution was 
only a small component of the main detector signal, it dominated the asymme- 
try measurement of the background detectors, which were highly correlated (see 
Extended Data Fig. 2b). A direct correlation between the main-detector asymmetry 
from these events and the background asymmetries measured by the background 
detectors was shown by blocking two of the eight openings in the first of the three 
Pb collimators with 5.1-cm-thick tungsten plates (see Extended Data Fig. 2a). To 
correct for this false asymmetry, a correlation factor was extracted (separately for 
Run 1 and Run 2) between the asymmetries of the main detector array and the 
upstream luminosity monitor array, as shown in Extended Data Fig. 2c. The cor- 
relation factor was combined with the measured upstream luminosity monitor 
asymmetry, averaged every 6 min, to correct the main-detector asymmetry in that 
interval. The resulting net corrections for Run 1 and Run 2 were App =3.9+4.5 
p.p.b. and App = —2.4+ 1.1 p.p.b., respectively, where the uncertainty includes con- 
tributions from the statistical error on the determination of the correlation, as well 
as systematic errors extracted by allowing the correlation factor to vary randomly 
within a reasonable range over different time periods™. 

Correction for helicity-correlated beam properties, Apeam. Residual non-vanishing 
correlations in the properties of the electron beam were accounted for through the 
correction Apeam. Five beam properties—the transverse beam position and angle 
(horizontal X, X’; vertical Y,Y’), as well as the energy—were monitored continu- 
ously as described above. The run-averaged helicity-correlated values of these 
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parameters are presented in Extended Data Table 1. Using the measured helicity- 
correlated beam differences Ay; and the sensitivity of the measured asymmetry 
to variations in the beam parameters 0A/0y;,; the corrections for the ith beam 
parameter were combined into the total correction: 


The detector sensitivities 0A /Oy, were measured routinely by varying the beam 
parameters using the beam modulation system described above. This system cre- 
ated small perturbations of the beam parameters about their nominal values with 
a sinusoidal driving function at 125 Hz. Furthermore, an accelerator fast-feedback 
system with an operational frequency range that included 125 Hz was used, driving 
the beam position and angle parameters with a sinusoidal pattern that was 90° out 
of phase with respect to the beam modulation system. The result was that the beam 
parameters were driven in more than one way, allowing for redundant measure- 
ments of the detector sensitivities using different combinations of the driven 
signals. The spread in those results was the dominant component of the systematic 
uncertainty for this correction. Typical values of the sensitivities are shown in 
Extended Data Table 1. For a perfectly symmetric apparatus, the position and angle 
sensitivities would be zero. For this apparatus, the sensitivities show that the hori- 
zontal plane had a larger broken symmetry than the vertical plane. The resulting 
corrections were Apeam = 18.5+4.1 p.p.b. and Apeam = 0.0 + 1.1 p.p.b. for Runs 1 
and 2, respectively. The considerably smaller correction in Run 2 was due to smaller 
position and angle helicity-correlated differences during that period. 
Rescattering bias, Apias. When comparing the measured asymmetry in the two 
PMTs (‘left’ and ‘right’) that read out the signal at each end of the eight main 
detectors, we found a consistent difference of about +300 p.p.b., as shown in 
Extended Fig. 3a. Here, ‘right’ is the beam direction k crossed with the radial 
direction, k x #. This effect is due to the left/right analysing power of the multiple 
scattering of transversely (radially) polarized electrons through the Pb pre- 
radiators of the main detectors. For perfect symmetry, this parity-conserving effect 
cancels when forming the parity-violating asymmetry of interest. Properly account- 
ing for the minor broken symmetries of the as-built apparatus leads to a correction, 
Abjas: referred to as rescattering bias. 

A schematic of the physical model for this effect is shown in Extended Data 
Fig. 3a. Scattered electrons, which are initially fully longitudinally polarized, 
acquire some transverse polarization through precession as they transport through 
the spectrometer’s magnetic field. 

The effect was modelled using a detailed Geant4°? simulation of the transport 
of the detected electrons through the spectrometer and the Pb radiator, including 
electromagnetic showering and Mott scattering. An asymmetry in the distribution 
of electrons penetrating the radiator develops owing to the analysing power of 
low-energy Mott scattering. A possible analysing power for high-energy scattering 
due to non-Born processes was also considered, but reasonable models for those 
processes®! showed a negligible contribution. 

Asymmetries obtained from this simulation (and from a variety of analytic 
effective models that reproduced the key features of the simulation) were combined 
with scattered electron flux distributions and tailored parameterizations of the 
Cherenkov-light yield for each detector to estimate Apia; and its uncertainty. The 
light yield varied strongly with the arrival angle and position of the electron on the 
detector. The light-yield parameterizations were developed to match the observed 
light-yield distribution by tuning the optical parameters in a Geant4”? optical- 
photon transport simulation. The largest systematic uncertainty was associated 
with the optical modelling of the individual as-built detectors. This uncertainty was 
determined from the range of predicted Apjas values, which was obtained by varying 
the optical parameters in the simulation while maintaining reasonable agreement 
with the measured light yield distributions. The predicted Apjas values for each 
detector are shown in Extended Data Fig. 3c. The resulting averaged correction 
and its systematic uncertainty are Apias = 4.3 + 3.0 p.p.b. 

This result was consistent to within 1 p.p.b. with a simpler, independent calcu- 

lation based on a phenomenological approach, which used the measured position 
distributions of the electron flux on the pre-radiators and the measured depend- 
ence of the light seen by each PMT on the track position on the detectors. The 
results obtained using the measured distributions from the experiment’s tracking 
system were compared with those obtained from the simulation. In this approach, 
the effect of the position differences was scaled to match the observed left/right 
asymmetry. The sensitivity to various models for the position dependence of the 
asymmetries was found to be small. 
Determination of Aep. The measured asymmetry Ams: was then corrected for 
incomplete beam polarization, the effects of various background processes, electro- 
magnetic radiative corrections and the finite acceptance of the detector, to obtain 
the fully corrected electron—proton asymmetry Aep, using: 


ep? 
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where Rig, = RicRaetRaccRg?, fi are dilutions to the signal and A; are false or back- 
ground process asymmetries. The components of equation (8) are discussed below. 

(1) Ryc. The electromagnetic radiative correction factor Ry. = 1.010 + 0.005 
accounts for the effect of internal and external bremsstrahlung of the incident 
electron, which can depolarize the electron and modify the momentum transfer 
Q’ at the scattering vertex. R,, was determined using a Geant3* simulation by 
comparing results with and without bremsstrahlung enabled in the simulation. 

(2) Raet- The Cherenkov detector analogue response (that is, the summed optical 
signal detected by the two PMTs attached at each end of each detector) varied as 
a function of the arrival location of the scattered electron on the detector. The 
magnetic optics of the spectrometer also caused a correlation of the electron arrival 
location with Q’ and, therefore, with the asymmetry. The correlation between the 
detector’s analogue response and the Q? value of each track was determined using 
the tracking-system drift chambers, and the resulting correction to the measured 
asymmetry was R¢et = 0.9895 + 0.0021. 

(3) Race. Owing to the finite acceptance of the spectrometer and the effect of 
radiative energy losses, Ams: represents an average over a range of Q? values. 
Because the asymmetry varies strongly with Q’, we used a simulation to correct 
the averaged asymmetry (A(Q’)) to the asymmetry that would arise from point 
scattering at the central (Q’), A((Q’)), using: 


_ A((Q’)) 


= (4(Q) = 0.977 + 0.002 


acc 


(4) Rg. The central (Q’) for the experiment was determined from a Geant4 
simulation that was benchmarked with measurements from the tracking system. 
The (Q?) value was not identical for Run 1 and Run 2 because of minor differences 
in the beam energy, target location and magnetic field of the spectrometer, with 
Run 1 having a higher (Q”). The global fit of A.p versus Q? (see Fig. 2) was used 
to determine the sensitivity of the asymmetry to small changes in (Q”). Run 2 was 
chosen as the reference for (Q”) , and the Run 1 asymmetry was scaled from its 
measured (Q’) value using R42 = 0.9928 (Rg? = 1 for Run 2, by definition). The 
determination of the central (Q’) value has a 0.45% relative uncertainty, domi- 
nated by the uncertainties on the locations of the collimator, target and main detec- 
tor and that of the beam energy determination. To simplify the global fitting, we 
decided to quote (Q”) as exact and used the sensitivity OA / OQ? to determine 
an effective error contribution to the asymmetry. This error on R92 was 0.0055 for 
both run periods. The acceptance-averaged Q’, scattering angle and incident elec- 
tron energy were (Q’) = 0.0248 GeV? c 7, (6) =7.90° and (Ey) = 1.149 GeV, 
respectively. 

(5) Beam polarization, P. To achieve a reliable determination of the beam pola- 
rization (P) at <1% accuracy, two different techniques with precisely calculated 
analysing powers were employed for redundancy. An existing Moller polarimeter” 
in experimental Hall C was used invasively 2-3 times a week. It measured the 
parity-conserving cross-section asymmetry in the scattering of polarized beam 
electrons from polarized electrons in an iron foil target at low (typically smaller 
than or equal to ~2 1A) beam currents. A newly installed, non-invasive Compton 
polarimeter’ monitored the beam polarization continuously at the full-production 
beam current of 180 1A. This device measured the parity-conserving asymmetry in 
the scattering of beam electrons from circularly polarized laser photons. For each 
run period, the averaged beam-polarization-corrected asymmetry was computed 
in two ways: by correcting each ~6-min period of data-taking for the polarization 
measured during that interval, and by using an overall average beam polariza- 
tion for the whole run period. Because the two methods gave the same result 
to a small fraction of the quoted uncertainty, for simplicity the results obtained 
using the overall average beam polarization are quoted here. The overall average 
beam polarizations for the two running periods were Pruni = (87.66 + 1.05)% and 
Prun2 = (88.71 + 0.55)%, where the uncertainties are predominantly systematic. For 
Run 1, the uncertainty was larger for two reasons: the Compton polarimeter was 
still being commissioned, so it was not used for this determination, and the Moller 
uncertainty was larger than usual owing to the need to correct for the effects of 
an intermittent short circuit in one of the quadrupole magnets of the polarimeter. 
For Run 2, both polarimeters were fully functional and agreed well with each 
other, as shown in Extended Data Fig. 4. A dedicated direct comparison of the 
Moller and Compton polarimeters under identical beam conditions at low beam 
current was also performed. The two techniques agreed within the uncertainties 
for that measurement, dP/P = 1% and dP/P=0.73% for the Compton and Moller 
polarimeters, respectively®. 

Physics backgrounds (target windows). The entrance (0.11-mm-thick) and exit 
(0.13-mm-thick) windows of the hydrogen target were made of aluminium 7075 
alloy. Electrons scattered from these windows caused the dominant background 


process (f,A; = 37 p.p.b. for Run 1 and f,A; = 38 p.p.b. for Run 2). The parity- 
violating elastic asymmetry for electrons scattered from aluminium is observed 
to be nearly an order of magnitude larger than that for scattering from hydrogen 
owing to the much larger weak charge of the aluminium nucleus. Therefore, 
even the small fraction of the detected yield arising from the windows required a 
substantial correction to the measured asymmetry. The aluminium asymmetry was 
determined in dedicated data-taking runs using an aluminium target made from 
the same block of material as the target windows but with a thickness (3.7mm) 
to match the radiation length of the hydrogen target. The ranges of scattering 
angles accepted from the upstream and downstream windows were different, which 
required a small kinematic correction to the measured alloy asymmetry to yield 
the asymmetry from the target windows A; = 1.515 +0.077 p.p.m. (see Extended 
Data Fig. 5). The uncertainty is dominated by statistics but includes systematic 
uncertainties arising from the kinematic correction, among others. The fraction 
of the measured yield arising from the target windows, f, = (2.471 + 0.056)% (Run 
1) and f, = (2.516 +0.059)% (Run 2), was measured with a low beam current on 
an evacuated target cell, and a simulation was used to correct for radiative effects 
due to the liquid hydrogen. 

Physics backgrounds (beamline background dilution). As described above (see 
‘Beamline-background asymmetry correction, App), a component of the back- 
ground came from scattering sources in the beamline. The dilution from this 
source was measured” to be fp = (0.193 + 0.064)% by blocking two of the eight 
openings in the first collimator to eliminate the electron elastic-scattering signal 
from the target. The uncertainty accounts for variation in the f, value between 
detectors and under different beam conditions. 

Physics backgrounds (neutral background). A possible contribution from 
low-energy neutral backgrounds arising from secondary interactions of the pri- 
mary scattered electrons in the collimators and magnet structure was bounded® 
to f; < 0.30% by subtracting f; from the total neutral background measured by the 
main detector after vetoing charged particles using thin scintillators. The asym- 
metry for this background was estimated® from a Geant4® simulation of the con- 
tributing processes to be A3= —0.39 + 0.16 p.p.m., with the dominant contribution 
coming from secondary interactions of electrons elastically scattered from protons. 
Physics backgrounds (inelastic electrons). An unavoidable background com- 
ponent comes from inelastically scattered electrons that have excited the target 
protons to the A(1232) resonance, a small fraction of which enter the acceptance 
of the spectrometer. The fraction of the yield from inelastically scattered electrons 
was estimated using simulation to be f,= (1.82 + 0.37) x 10-4. To determine the 
correction to the asymmetry owing to these events, we measured the parity-violating 
asymmetry in the energy region near the A(1232) resonance during a dedicated 
study with a reduced spectrometer current to concentrate these electrons on the 
detectors. Scaling this asymmetry up to the Q? value of the elastic peak gives an 
inelastic asymmetry of Ay= —3.0+ 1.0 p.p.m. at the elastic peak. Backgrounds 
from 7 particles and other hadrons were negligible. 

Summary of corrections to the asymmetry and their uncertainties. The separate 
and combined measured asymmetries for Runs 1 and 2 are presented in Extended 
Data Table 2. Also shown is the breakdown of these uncertainties in terms of 
descending fractional significance. Extended Data Table 3 presents the numerical 
values of the raw asymmetry, A;aw, along with all systematic and acceptance cor- 
rection factors used to derive the observed measured asymmetry, Ams, and physics 
asymmetries, Aep, using equations (7) and (8). Correlated uncertainties, accounted 
for when the two runs were combined, are also listed. 

The fully corrected asymmetry is Agp = —223.5 + 15.0 (statistical) - 10.1 (sys- 
tematic) p.p.b. for Run 1 and A.p = —227.2 + 8.3 (statistical) + 5.6 (systematic) 
p.p.b. for Run 2. The combined asymmetry is Agp = —226.5 + 7.3 (statistical) + 5.8 
(systematic) p.p.b. 

Data quality. Two representative tests of the consistency and the quality of the 
corrected asymmetries are presented below. 

Null result. The Qweak experiment employed signal phase locking on three inde- 
pendent techniques of polarized-beam helicity reversal that were used to isolate 
the scattering asymmetry. These were the rapid (960 Hz) reversal, the insertion of 
a half-wave plate in the source laser optical path at 8-h intervals and the Wien- 
filter reversal at monthly intervals. The half-wave plate technique is based on a 
mechanical action and thus is unable to induce any false asymmetries electri- 
cally or magnetically. The Wien-filter reversal rejects false asymmetries induced 
by beam-size (or focus) modulation. By constructing an out-of-phase, or null, 
asymmetry Anu from the latter two slow-reversal techniques, we can determine 
whether there are unaccounted-for false asymmetries. The full dataset-weighted 
null is Any =—1.75+6.51 p.p.b., which is consistent with zero, as expected. 
Asymmetry measurements. A plot of the observed main-detector asymmetry versus 
the Wien-filter configuration is shown in Extended Data Fig. 6. Run 1 and Run 
2 were separated by a six-month accelerator shutdown, during which numerous 
modifications were made to the experimental apparatus and accelerator. These 
included upgrading the electronics associated with the beam current measurement 
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and increasing the number of associated BCMs. An electrically isolated helicity 
phase-locked beam-position stabilization system was enabled in the 6-MeV region 
of the injector during Run 2. This greatly improved the helicity-correlated stability 
of the beam delivered to the experiment. There were also radio-frequency- 
associated electronics and superconducting-cavity upgrades performed within 
the accelerator, unrelated to this experiment, as well as extensive upgrades to both 
beam polarimeters. As a consequence, the contributions of many beam-related 
systematic effects meaningfully changed between the two run periods. However, 
the resulting fully corrected physics asymmetries of the two run periods agree 
well. This is evidence that within the experiment’s precision, the observed set of 
identified systematic effects is complete and their associated correction algorithms 
behave in a deterministic manner. 

Electroweak radiative corrections and extraction of sin’?@w. The weak mixing 
angle is obtained from the proton’s weak charge, taking into account energy- 
independent electroweak radiative corrections using! 


P —Oww —Ozz —O,z(0 
Q,, Www ZZ zi LAL 
(p+ A,) 


4sin’0y(0) = 1 (9) 


Here, Oww and Qzz are the WW and ZZ boson box-diagram radiative correc- 
tions, p is the renormalization of the ratio of neutral-current (Z°) to charged-cur- 
rent (W+) interactions at low energy, A, is the electron vertex correction to the 
axial Zee interaction, Ai is the electron anapole moment and O.z(0) refers to 
the remaining energy-independent piece of the yZ-box diagram (the energy- 
dependent piece was discussed in ‘Global fit’). 

The accidental suppression of the proton’s weak charge in the standard model 
means that QP is unusually sensitive to sin’Ow. To see this quantitatively, our deter- 
mination of Q? to a precision of 6.25% results in a sin’Ow precision of 0.46%. By 
contrast, the higher relative precision (0.59%) of the weak charge measurement'*!° 
on !3Cs, which is dominated by the neutron’s weak charge, which is not suppressed 
in the standard model, leads to a sin?0w uncertainty of 0.81%, almost twice the 
uncertainty of our result. 

The radiative corrections appearing in equation (9) are described in ref. ! but 
were re-evaluated using the more recent input found in refs *''. Although QP in 
equation (6) is defined in the Thomson limit (Q<m,, where m, is the electron 
mass) at Q=0, it is determined from data in the scattering limit (Q>>m-). We 
chose to use radiative corrections in the Thomson limit from ref. 1 to calculate 
sin*@y(0) from QP. Extended Data Table 4 lists these corrections’*')% 

To compare our result for sin’@y(0), given by equation (9), with that of refs 
we added a correction of 2a/(97), consistent with the definition of ref. ©, as dis- 
cussed in ref, 2°, The result is sin?@w(Q= 0) = 0.2384 0.0011 in the MS scheme’. 
Shifting this to the Q=0.158 GeV of the Qweak experiment (a correction of 
—0.00012), our result is sin?Ay(Q= 0.158 GeV) = 0.2382 £0.0111. 

Data availability. The 200 TB of raw data acquired in this work are stored at the 
Jefferson Laboratory data silo. The derived data supporting the findings of this 
study are available from the corresponding authors upon request. 

Code availability. The software used for data management and analysis consisted 
of commercial and publicly available codes, plus experiment-specific software. 
Jefferson Laboratory’s data management plan is available at https://scicomp.jlab. 
org/DataManagementPlan.pdf. The experiment-specific software is stored in a 
version management system (SVN & GIT) and archived at the data storage facilities 
of Jefferson Laboratory, in accordance with existing US regulations. Requests for 
this material should be addressed to the corresponding authors. 
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Extended Data Fig. 1 | Apparatus. a, Schematic of critical accelerator 
components and the Qweax apparatus*. The electron beam is generated 

at the photocathode, accelerated by the Continuous Electron Beam 
Accelerator Facility (CEBAF) and sent to experimental Hall C, where it is 
monitored by beam position monitors and beam current monitors. The 
insertable half-wave plate (IHWP) provides slow reversal of the electron 


beam helicity. The data acquisition system records the data. b, Computer- 
aided design drawing of the experimental apparatus. c, The Qweak 
apparatus, before the final shielding configuration was installed. d, Interior 
of the hut shielding the detectors, showing two of the Cherenkov detectors 
(right) and a pair of tracking chambers (left). 
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Extended Data Fig. 2 | Beamline background. Determination of 
App, the false asymmetry arising from beamline background events. 
Uncertainties are 1 s.d. a, Correlation of the main detector asymmetry to 
that of the upstream luminosity monitors, measured when the signal from 
elastically scattered electrons in the main detectors was blocked at the first 
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collimator. b, Correlation of asymmetries from the upstream luminosity 
monitors with one of the other background detectors (a bare PMT located 
in the detector shield house). c, Correlation of the unblocked main 
detector asymmetry to that of the upstream luminosity monitor for Run 2. 
Our App determination was based on this slope. 
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Extended Data Fig. 3 | Rescattering bias. a, Schematic illustrating the 
precession of longitudinally polarized electrons through the spectrometer 
magnet, generating sizeable transverse spin components upon arrival at 
the detector array (spin directions indicated by red and blue arrows for the 
two electron helicity states). An end-view of the detector array, indicating 


the right (R) and left (L) PMT positions, is shown on the left. b, Difference 
between the asymmetry measured by the two (R and L) PMT tubes versus 
the detector number (Run 2 data). c, Calculated rescattering bias Apias 
versus detector number, with the eight-detector-averaged value shown by 
the red lines. Uncertainties (1s.d.) are systematic. 
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Extended Data Fig. 4 | Electron beam polarization. Measurements polarization values used in the evaluation of the parity-violating 

from the Compton (closed blue circles) and Moller (open red squares) asymmetry Aep. The time dependence of the reported polarization is 
polarimeters during Run 2. Inner error bars denote statistical uncertainties driven primarily by the continuous Compton measurements, with a small- 
and outer error bars show the statistical and point-to-point systematic scale correction (0.21%, not included in this figure) determined from 
uncertainties added in quadrature. Normalization, or scale-type, an uncertainty-weighted global comparison of the Compton and Moller 
uncertainties are shown by the solid blue (Compton) and red (Moller) polarimeters. 


bands. All uncertainties are 1 s.d. The yellow band shows the derived 
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Aluminum Asymmetry [ppm] 
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Data Set 
Extended Data Fig. 5 | Asymmetry from aluminium. Parity-violating generated through (g. — 2) precession (g., electron gyromagnetic ratio) 
asymmetry from the aluminium alloy target versus the dataset number. via a modified accelerator configuration during Wien 6 is indicated. The 
All uncertainties are 1 s.d. The labels ‘IN’ and ‘OUT’ refer to the state of combinations OUT-R and IN-L with no (g. — 2) spin flip reveal the 
the insertable half-wave plate at the electron source, which generated physical sign of the asymmetry. Solid lines represent the time-averaged 
a 180° flip of the electron spin when IN. The subscripts denote the values, and the horizontal dashed line indicates zero asymmetry. The 
setting of the Wien filter, with L and R corresponding to the presence vertical dashed lines delineate particular data subsets with a given Wien 
and absence, respectively, of an additional 180° rotation of the spin filter setting. 


direction of the electron beam. A period in which a further 180° flip was 
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Extended Data Fig. 6 | Asymmetry from the proton. Observed parity- configuration is indicated. The combinations OUT-R and IN-L with 
violating asymmetry Ag, after all corrections, versus the dataset number no (ge — 2) flip reveal the physical sign of the asymmetry. Solid lines 
(acquired in the double-Wien-filter configuration). The Wien filter represent the time-averaged values and the dashed line indicates 
reversed the beam helicity at approximately monthly intervals. The zero asymmetry. The uncertainties (1 s.d.) shown are those of the 
subscripts denote the setting of the Wien filter as L or R, corresponding corresponding Ams; values (see text) only—that is, they do not include 
to the presence or absence, respectively, of a 180° rotation of the spin time-independent uncertainties—so as to illustrate the time stability of 
direction of the electron beam. IN and OUT refer to the state of the the results. The weighted mean and P-value of the upper OUT-L and 
insertable half-wave plate at the electron source, generating an additional IN-R data are 226.9 + 10.2, P=0.59 (upper solid line), respectively. For 
180° flip of the spin when IN. A period in which a further 180° flip the opposite combination, OUT-R and IN-L, we find a weighted mean of 
was generated through (g. — 2) precession via a modified accelerator —226.1+ 10.5 and P= 0.36 (lower solid line). 
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Extended Data Table 1 | Helicity-correlated beam parameter differences and sensitivities 
Beam Parameter Run 1 Ay; Run 2 Ay; 
x —3.0 0.1 nm =—2.9 20.1 nm 


Typical 0A/Ox; 
—2 ppb/nm 


XxX! —0.30 + 0.01 nrad|—0.07 + 0.01 nrad| 50 ppb/nrad 


< 0.2 ppb/nm 
—0.04+0.01 nrad| < 3 ppb/nrad 


—6 ppb/ppb 


A —7.5+0.1 nm 0.8+0.1 nm 
yy —0.07 + 0.01 nrad 


Energy —1.69 + 0.01 ppb | —0.12 + 0.01 ppb 


The beam parameter differences and typical detector sensitivities for the five meas: 


ured beam parameters for Run 1 and Run 2. Uncertainties are 1s.d. 
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Extended Data Table 2 | Asymmetries and their corrections 
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Run 1 
Run 2 


(ppb) 


Period Asymmetry|Stat. Unc./Syst. Unc. 
(ppb) 


(ppb) 


Tot. Uncertainty 
(ppb) 


Run 1 and 2 combined 
with correlations 


Quantity 


BCM Normalization: Agcy 
Beamline Background: App 
Beam Asymmetries: Apeam 
Rescattering bias: Apjias 
Beam Polarization: P 
Target windows: A; 
Kinematics: Rg 

Total of others 

Combined in quadrature 


Run 1 


error (ppb) 


d.1 
D.1 
A.7 
3.4 
22 
1.9 
2 
2.9 
[0.1 


Run 1 


25% 
25% 
22% 
11% 
5% 
4% 
2% 
6% 


Run 2 Run 2 
fractional|error (ppb) | fractional 


Top, corrected asymmetries Ae, for the Run 1 and Run 2 datasets, and the combined value, with their statistical (Stat. Unc.), systematic (Syst. Unc.) and total (Tot.) uncertainties (1 s.d.), in parts per 
billion (ppb). Bottom, fractional quadrature contributions (o//o%o1)* to the systematic uncertainty (1 s.d.) on Aep for Run 1 and Run 2. Only error sources with fractional contribution > 5% in one of the 
runs are shown. 
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Extended Data Table 3 | Raw asymmetries and their corrections 
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Quantity Run 1 Run 2 Correlation 
Araw —192.7+13.2 ppb |—170.7+7.3 ppb — 
Ar 0O+1.1 ppb 0+0.7 ppb 0 
Ay 1.3 2 1.0 ppb 1.2+0.9 ppb 1 
Apcm 0+ 4.4 ppb 0+ 2.1 ppb 0.67 
App 3.9 +4.5 ppb —2.4+1.1 ppb 0 
Apeamn 18.5 + 4.1 ppb 0.0+1.1 ppb 0 
Aijing 4.3+ 3.0 ppb 4.3 + 3.0 ppb 1 
Amsr —164.6 + 15.5 ppb | —167.5 + 8.4 ppb — 
| 87.66 + 1.05 % 88.71 + 0.55 % 0.19 
fi 2.471 + 0.056 % 2.516 + 0.059 % 1 
Ay 1.514 + 0.077 ppm | 1.515 + 0.077 ppm 1 
fr 0.193 + 0.064 % 0.193 + 0.064 % it 
fs 0.12 + 0.20 % 0.06 + 0.12 % 1 
A3 —0.39 £0.16 ppm | —0.39 + 0.16ppm 1 
fa 0.018 + 0.004 % 0.018 + 0.004 % 1 
A4 —3.0+1.0 ppm —3.0 + 1.0ppm 1 
Rre 1.010 + 0.005 1.010 + 0.005 1 
Ret 0.9895 + 0.0021 0.9895 + 0.0021 1 
hee 0.977 + 0.002 0.977 + 0.002 1 
Rage 0.9928 + 0.0055 1.0 + 0.0055 1 
Reot 0.9693 + 0.0080 0.9764 + 0.0080 1 
eee ate — 1 


d. 


Extended Data Table 4 | Radiative corrections 
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Term Expression Value|Reference 
Pvc 1+A, 1.00066] 1, 2 
de. —a/2Qn —0.001161) 1,2 
At — (1 — 48) [In (48) +3] 0.001411] 1,2 
a =o(Mz) _ 1/127.95] 1, 2 
8? —1-—@ = sin’ Ow(Mz) 0.23129} 1,2 
a,(M?,) - 0.12072) 67 
Oww ag [245 (1 — i) 0.01831] 1, 2 
Clzz | aeSq [9/4 - 58] (1 — 45? + 857) (1- 2@2))) 0.00185) 1, 2 
Llyz axial-vector hadron piece of Oz: Fe OY, 0.0044 11 


Numerical values used for the electroweak radiative corrections in equation (9). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


1 oa Ie en 


https://doi.org/10.1038/s41586-018-0085-3 


Challenging local realism with human choices 


The BIG Bell Test Collaboration* 


A Bell test is a randomized trial that compares experimental 
observations against the philosophical worldview of local realism!, 
in which the properties of the physical world are independent of our 
observation of them and no signal travels faster than light. A Bell test 
requires spatially distributed entanglement, fast and high-efficiency 
detection and unpredictable measurement settings**. Although 
technology can satisfy the first two of these requirements* ’, the use 
of physical devices to choose settings in a Bell test involves making 
assumptions about the physics that one aims to test. Bell himself 
noted this weakness in using physical setting choices and argued that 
human ‘free will could be used rigorously to ensure unpredictability 
in Bell tests®. Here we report a set of local-realism tests using human 
choices, which avoids assumptions about predictability in physics. 
We recruited about 100,000 human participants to play an online 
video game that incentivizes fast, sustained input of unpredictable 
selections and illustrates Bell-test methodology’. The participants 
generated 97,347,490 binary choices, which were directed via a 
scalable web platform to 12 laboratories on five continents, where 
13 experiments tested local realism using photons™®, single atoms’, 
atomic ensembles!° and superconducting devices!!. Over a 12-hour 
period on 30 November 2016, participants worldwide provided a 
sustained data flow of over 1,000 bits per second to the experiments, 
which used different human-generated data to choose each 
measurement setting. The observed correlations strongly contradict 
local realism and other realistic positions in bipartite and tripartite!” 
scenarios. Project outcomes include closing the ‘freedom-of-choice 
loophole’ (the possibility that the setting choices are influenced by 
‘hidden variables’ to correlate with the particle properties!*), the 
utilization of video-game methods" for rapid collection of human- 
generated randomness, and the use of networking techniques for 
global participation in experimental science. 

Bell tests, like Darwin’s studies of finches and Galileo’s observations 
of the moons of Jupiter, use empirical methods to address questions 
previously accessible only by other means, for example, by philosophy 
or theology’®. Local realism—that is, realism plus relativistic limits 
on causation—was debated by Einstein and Bohr using metaphysical 
arguments and recently has been rejected by Bell tests*” that closed 
all technical ‘loopholes. For example, the ‘detection-efficiency loop- 
hole’ describes the possibility that the observed statistics are inaccurate 
owing to selection bias, and is closed by using high-efficiency detection 
and statistical methods that include all trials in the analysis. Recent 
work on device-independent quantum information!® shows how Bell 
inequality violation (BIV) can also challenge causal determinism”, 
another topic that was formerly accessible only by metaphysics!*. 
Central to both applications is the use of free variables to choose 
measurements: in the words of Aaronson!’, “Assuming no preferred 
reference frames or closed timelike curves, if Alice and Bob have 
genuine ‘freedom in deciding how to measure entangled particles, then 
the particles must also have ‘freedom in deciding how to respond to 
the measurements’. 

Previous Bell tests used physical devices”*”! to ‘decide’ for Alice and 
Bob, and thus demonstrated only a relation among physical processes: if 
some processes are ‘free’ in the required sense (see Methods, ‘Freedom’ 
in Bell tests), then other processes are similarly ‘free’ In the language of 


strong Bell tests, this conditional relation leaves open the freedom-of- 
choice loophole (FOCL), which describes the possibility that ‘hidden 
variables’ influence the setting choices. Because we cannot guarantee 
such freedom within local realism, the tests must assume physical 
indeterminacy in the hidden-variable theory”. Laboratory methods 
can tighten, but never close, this loophole?-®. 

Gallicchio, Friedman and Kaiser”” have proposed choosing set- 
tings by observing cosmic sources at the edge of the visible Universe. 
A BIV under such conditions could only be explained within local 
realism if events across history conspired to produce the measured 
outcomes”**, Bell himself argued that human choices could be con- 
sidered ‘free variables’ in a Bell test® (see Methods, John Stewart Bell 
on ‘free variables’), and noted the impracticality of using humans 
with 1970s’ technologies. Here we implement Bell’s idea, using mod- 
ern crowd-sourcing, networking and gamification’* techniques. In 
this BIG Bell Test (BBT), Alice and Bob of Aaronson’s formulation 
are real people. Assuming no faster-than-light communication, such 
experiments can prove the conditional premise that if human will is 
free, there are physical events (the measurement outcomes in the Bell 
tests) that are intrinsically random, that is, impossible to predict?° ‘ 
We note that although this argument in no way uses the theory of 
quantum mechanics, it arrives at one of the theory’s most profound 
claims. Intrinsic randomness supported by a BIV is central to device- 
independent quantum technologies!©”°. 

It is perhaps surprising that human choices, which are known to 
contain statistical regularities’’, are suitably unpredictable for a Bell 
test. Recent works on the statistical analysis of Bell tests*”®° show that 
sequence randomness—that is, the absence of patterns and correlations 
in the sequence of choices—is not, per se, a requirement for the rejec- 
tion of local realism. Rather, statistical independence of choices from 
the hidden variables that describe possible measurement outcomes is 
required (see Methods, ‘Freedom in Bell tests). This independence 
can fail in different ways, which are categorized into named loopholes. 
The ‘locality loophole’ describes the possibility that a choice at one 
station could influence a measurement result at the other station. The 
term ‘locality’ reflects one way of blocking this possibility, by space-like 
separation of the choice and measurement events (see Methods, Use 
of ‘freedom-of-choice loophole’ and ‘locality loophole’ in this work). 

Patterns strongly affect statistical strength in experiments that aim to 
close the locality loophole by space-like separation—they allow current 
choices to be predicted from earlier choices, which have had more time 
to reach the distant measurement. As described below, the BBT tightens 
the locality loophole by using many independent experiments instead 
of space-like separation. Furthermore, the human capacity for free 
choice removes the need for assumptions about physical indetermin- 
ism, allowing the FOCL to be closed. Thus, although human choices 
show imperfect sequence randomness, they nonetheless enable a strong 
rejection of local realism with the BBT strategy. 

A major obstacle to a Bell test with humans has been the difficulty in 
generating enough choices for a statistically significant test. A person 
can generate roughly three random bits per second, while a strong 
test may require millions of setting choices in a time span of minutes 
to hours, depending on the speed and stability of the experiment. To 
achieve such rates, we crowd-sourced the basis choices, recruiting about 


*A list of authors and their affiliations appears at the end of the paper. 
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Fig. 1 | Structure of the BBT. a, Human participants, or Bellsters, enter 
Os and 1s in an online video game that incentivizes sustained generation 
of unpredictable input. Image: ICFO/Maria Pascual (Kaitos Games). 

b, Experiments use Bellster-generated bits to control measurement- 
defining elements, such as wave-plates for photons or microwave pulses 
for matter qubits. Shown is a micrograph of superconducting qubits used 
in experiment (7), with the measured Clauser-Horne-Shimony-Holt 
Bell parameter S mid-way through the BBT. c, A cloud-based networking 


| 
fle 


100,000 participants, the ‘Bellsters, over the course of the project. Each 
choice made by a participant, encoded as a bit (0 or 1), was entered in 
an internet-connected device, such as the participant’s mobile phone. 
Servers relayed these bits to the 13 experiments (see Fig. 1), which used 
them to determine individual settings without re-use (except experi- 
ment (2)). To encourage participants to contribute a larger number 
of more unpredictable bits, the input was collected in the context of 
a video game, The BIG Bell Quest (available at https://museum.the- 
bigbelltest.org/quest/), implemented in JavaScript to run directly in a 
device's web browser. 

The BIG Bell Quest was designed to reward sustained, high-rate 
input of unpredictable bits, while being engaging and informative 
(see Methods, Gamification). An interactive explanation first describes 
quantum nonlocality and the role played by participants and experi- 
menters in the BBT. The player is then tasked with entering a given 
number of unpredictable bits within a limited time. A machine learning 
algorithm (MLA) attempts to predict each input bit by modelling the 
user’s input as a Markov process and updating the model parameters 
using reinforcement learning (see Methods, Prediction engine). Scoring 
and level completion reflect the degree to which the MLA predicts 
the player’s input, motivating players to consider their own predict- 
ability and take conscious steps to reduce it. However, the MLA does 
not act as a filter, and all input is passed to the experiments. Bellsters 
input showed unsurprising deviations from ideal randomness”; for 
example, P(0) % 0.5237 (bias towards 0), whereas adjacent bits show 
P(01) + P(10) + 0.6406 (excess of alternation). 

Modern video-game elements were incorporated to boost engage- 
ment (animation and sound), encourage persistent play (progressive 
levels, ‘power-ups, ‘boss battles’ and leaderboards) and to recruit new 
players (group formation and postings to social networks). Different 
level scenarios illustrate the key elements of the BBT (human input, 
global networking and measurements on quantum systems), and boss 
battles against the Oracle (see Methods) convey the conceptual chal- 
lenge of unpredictability. Level completion is rewarded with (i) a report 
on how many bits from that level were used in each experiment run- 
ning at that time, (ii) a ‘curious fact’ about statistics, Bell tests or the 
various experiments and, if the participant is lucky, (iii) one of several 
videos recorded in the participating laboratories, explaining the exper- 
iments. The game and BBT website are available in Chinese, English, 
Spanish, French, German, Italian and Catalan, making them accessible 
to roughly three billion speakers of these languages. 

To synchronize participant activity with experimental operation, the 
Bell tests were scheduled to take place on a single day, Wednesday 30 
November 2016. The date was chosen so that most schools worldwide 
would be in session and to avoid competing media events, such as the 


LETTER 


Database 
30 November data 


Back-up bucket 


RS 
= 
soa rr — 


system integrates the activities shown in a and b, serving game elements to 
Bellsters, distributing input bits to connected laboratories and providing 
in-game feedback about the experimental use of the player’s input. 
Through this system, Bellsters are given direct—although brief—control 
of the experimental apparatus, so that each measurement setting is 
determined by a single human choice, which is traceable to a given user 
and time of entry (see Methods, Networking strategy and architecture). 


US presidential election. Participants were recruited through a variety 
of channels, including traditional and social media, as well as school 
and science museum outreach programmes, with each partner insti- 
tution handling recruitment in their familiar geographical regions and 
languages. The media campaign focused on the nature of the experi- 
ment and the need for human participants. The press often communi- 
cated this with headlines such as “Quantum theory needs your help” 
(China Daily). A first, small campaign in early October 2016 began 
spreading the story by word of mouth and a second, large campaign 
on 29-30 November of that year was made to attract a wide partici- 
pant base. The media campaign generated at least 230 headlines in the 
printed and online press, radio and television. 

The data networking architecture of the BBT, shown in Fig. 1, 
includes elements of instant messaging and online gaming and is 
designed to efficiently serve a fluctuating number of simultaneous users 
that is not known in advance and could range from 10 to 100,000. 
A gaming component handles the BBT website, participant account 
management, delivery of the game code (JavaScript and video), score 
records and leaderboards. In parallel, a messaging component handles 
data conditioning, streaming to experiments and reporting of parti- 
cipant choices generated via the game. Horizontal scaling is used in 
both components; participants do not connect to the servers directly, 
but connect to dynamic load balancers that spread the input among a 
pool of servers that are dynamically scaled in response to the load. The 
timing of input bits (but not their values) was used to identify robot 
participants and remove their input from the data stream, although 
game operation was unchanged to avoid alerting the robots’ masters. 
A single, laboratory-side server received data from the participant-side 
servers, concatenated the user input and streamed it to the laboratories 
at laboratory-defined rates (see Methods, Networking strategy and 
architecture). 

According to global time zoning, 30 November defines a 51-h win- 
dow, from 0:00 utc (coordinated universal time) plus 14h (for exam- 
ple, Samoa) to 23:59 uTc minus 12h (for example, Midway Island). 
Nevertheless, most participants contributed during a 24-h window cen- 
tred on 18:00 utc. The recruitment of participants was geographically 
uneven, with a notable failure to recruit large numbers of participants 
from Africa. Despite this, the latitude zones of Asia—Oceania, Europe- 
Africa and the Americas had comparable participation, which proved 
important for the experiment. As shown in Fig. 2, input from any single 
region dropped to low values during the local early morning but was 
compensated by high input from other regions, resulting in a sustained 
high global bitrate. Over the 12-h period from 09:00 utc to 21:00 uTc 
on 30 November 2016, the input exceeded 10? bits per second, allow- 
ing the majority of the experiments to run at their full speeds. Several 
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Fig. 2 | Geography and timing of the BBT. a, Locations of the 13 BBT 
experiments, ordered from east to west. The index numbers label the 
experiments, which are summarized in Table 1. Shading shows total 
sessions by country. Eight sessions from Antarctica are not shown. Map 
created by G. Colangelo using data from OpenStreetMaps, rendered 

in Wolfram Mathematica. b, Temporal evolution of the project. The 

top graph shows the number of live sessions versus time for different- 
continent groups, which exhibits a large drop in the local early morning 
in each region. The spike in the participation of the Asian group around 
11:00 utc coincides with a live-streamed event in Barcelona, hosted by 


experiments posted their results live on social networks. Owing to their 
high speeds, experiments (12) and (13) accumulated participants’ input 
to use in short bursts. As determined by separate tests, for example of 
interference visibility, experiments (9) and (12) were not in condition 
to observe a BIV on the day of the event; they reported later results 
using stored input. 

Because the Earth has a diameter of only 43 light milliseconds, 
human choices are too slow to be space-like separated from the meas- 
urements. This leaves open the locality loophole regarding the influ- 
ence of choices on remote detection. Influence of Alice’s measurement 
setting on Bob’s detection (and vice versa) is nonetheless excluded by 
space-like separation in experiments (3) and (13) (see Supplementary 
Information). To tighten the locality loophole, we employ a strategy 
that we call the BIG test; namely, the use of many simultaneous Bell 
tests in widely separated locations and using different physical systems, 
with each experiment’s apparatus constructed and operated by different 
experimental teams. The only hidden-variable theories that escape this 
tightening are those in which choices can simultaneously influence hid- 
den variables in many differently constructed experiments to produce 
a BIV in each one. This strategy is strengthened by using the same bits 
in many experiments, as described above. 

The set of 13 BBT experiments, including true Bell tests and other 
realism tests requiring free choice of measurement, are summarized 
in Table 1 and described in Supplementary Information. Experiments 
(1)-(5), (8) and (11)-(13) used entangled photon pairs, (6) used 
entanglement between single photons and single atoms, (9) used 
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18:00 
——74>—“-—X— 30 November (utc) ————————> <_ 1 December (uc) 


00:00 06:00 12:00 18:00 


D. Jiménez and the CosmoCaixa science museum, re-broadcast live in 
Chinese by L.-F. Yuan and the University of Science and Technology 

of China (USTC). The middle graph shows the number of connected 
laboratories versus time, divided into experiments using only photons 
and experiments with at least one material component (such as atoms or 
superconductors). The bottom graph shows the input bitrate versus time. 
The data flow remains nearly constant despite regional variations, with 
Asian Bellsters handing off to Bellsters from the Americas in the critical 
period 12:00-00:00 utc. Session data from Google Analytics. 


entanglement between single photons and atoms and experiment (7) 
used entangled superconducting qubits. Experiments (7) and (13) used 
high-efficiency detection to avoid the fair-sampling assumption, thus 
closing simultaneously the detection-efficiency loophole and the FOCL. 
Experiment (5) demonstrated a violation of bilocal realism, and (10) 
violated a Bell inequality for multi-mode entanglement. Experiment (1) 
demonstrated quantum steering and (2) investigated temporal quan- 
tum correlations with a three-station measurement. Experiment (12) 
closed the post-selection loophole that is typically present in Bell 
tests based on energy-time entanglement. The analysis of the 
experimental results of (3) sets bounds on how well a measurement- 
dependent local model would have to predict Bellster behaviour to 
produce the observed results*°. Experiments (3), (4), (6) and (13) tested 
whether human-generated measurement choices gave different results 
from machine-generated ones. Most experiments observed statistically 
strong violations of their respective inequalities, supporting the rejec- 
tion of local realism in a multitude of systems and scenarios. 

In summary, on 30 November 2016, a set of 13 Bell tests and sim- 
ilar experiments using photons, single atoms, atomic ensembles and 
superconducting devices, demonstrated strong disagreement with 
local realism, using measurement settings chosen by tens of thousands 
of globally distributed human participants. The results also showed 
empirically that measurement-setting independence—here provided 
by human agency—is in strong disagreement with causal determin- 
ism!7~!?, a topic formerly accessible only by metaphysics. The experi- 
ments reject local realism in a wide variety of physical systems and 
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Table 1 | Experiments carried out as part of the BBT, ordered by longitude, from east to west 


Experiment Lead Institution Location Entangled system Rate (bps) Inequality Result Stat. sig. 
(1) Griffith University Brisbane, Australia Photon polarization 4 Sig < 0.511 Sj6=0.965+0.008 570 
(2) University of Brisbane, Australia Photon polarization 3 S| <2 Sap=2.75+0.05 150 
Queensland & EQUS Spc=2.79+0.05 160 
(3) USTC Shanghai, China Photon polarization 03 PRBLG?° Ib =0.10+0.05 N/A 
(4) 1Q0QI Vienna, Austria Photon polarization 61 x 108 $|<2 Surn= 2.639 + 0.008 8lo 
Sorn = 2.643 +0.006 1160 
(5) Sapienza Rome, Italy Photon polarization 0.62 B<l B=1.225+0.007 320 
(6) LMU Munich, Germany Photon-atom af $| <2 Surn=2.427 + 0.0223 190 
Sorn=2.413 40.0223 18.50 
(7) ETHZ Zurich, Switzerland Transmon qubit 3x 108 s| <2 S=2.3066+0.0012 P<10-99 
(8) INPHYNI Nice, France Photon time bin 2x 103 S| <2 S=2.431+0.003 1400 
(9) ICFO Barcelona, Spain Photon-atom ensemble 25 $| <2 $=2.29+0.10 2.90 
(10) ICFO Barcelona, Spain Photon multi-frequency bin 20 $| <2 $=2.25+0,08 3.lo 
(11) CITEDEF Buenos Aires, Argentina Photon polarization 02 §| <2 $=2.5540.07 780 
(12) UdeC Concepci6n, Chile Photon time bin 5.2 x 104 $| <2 $=2.43+0.02 200 
(13) NIST Boulder, USA Photon polarization 05 K<0O K=(1.6540.20)x 10-4 8.70 
Descriptions of the experiments are given in Supplementary Information. Stat. sig., statistical significance; indicates the number of standard deviations assuming independent and identically 
distributed trials, unless otherwise indicated. Rate indicates the peak rate (in bits per second, bps) at which bits were used by the experiments. Owing to the limited rate of Bellster input, some 
experiments had dead times. B, K, S, Sag, Sac, Suan and Sgrn indicate Bell parameters for the respective experiments and Sj¢ is the steering parameter (see Supplementary Information). /o indicates the 


minimum Putz—Rosset-Barnea-Liang-Gisin measure of setting—choice independence, consistent with the observed BIV. 
USTC, University of Science and Technology of China; EQUS, Centre for Engineered Quantum Systems; IQOQI, Institute for Quantum Optics and Quantum Information; INFYNI, Institut de Physique 
de Nice; ICFO, Institut de Ciencies Fotoniques; LMU, Ludwig-Maximilians-Universitat; ETHZ, ETH Zurich; CITEDEF, Institute of Scientific and Technical Research for Defence; UdeC, University of 


Concepcién; NIST, National Institute of Standards and Technology. 


scenarios, set the groundwork for Bell-test-based applications in quan- 
tum information, introduce gamification to randomness generation 
and demonstrate global networking techniques by which hundreds 
of thousands of individuals can directly participate in experimental 
science. 


Online content 
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METHODS 


Local realism, Bell parameters, Bell inequalities. In their 1935 article*!, Einstein, 
Podolsky and Rosen employed notions of locality (actions or observations in one 
location do not have immediate effects in other locations) and realism (observ- 
ables have values even if we do not observe them) to argue that quantum the- 
ory was incomplete and could, in principle, be supplemented with information 
about which outcomes actually occur in any given run of an experiment. Bell 
formalized these notions by defining local hidden-variable models (LHVMs), a 
class of non-quantum theories that are simultaneously local and realistic. We con- 
sider the simplest case, of two systems measured by two observers, Alice and Bob. 
We write x(y) to represent Alice’s (Bob’s) measurement setting, a(b) to represent 
their measurement outcomes, and X to represent the hidden variable—something 
that we cannot measure, but we include in the model to explain why a and b take 
on particular values. The predictions of any such bipartite LHVM are given by 


P(a, b|x, y) = > P(a|x, A)P(bly, A)P(A) (1) 
Xn 


where P(- | -) indicates a conditional probability. That is, the probability of get- 
ting outcome a and b when Alice and Bob measure x and y, respectively, can be 
expressed in terms of the local conditional probabilities P(a | x, 4) and P(b | y, A). 
A is averaged over, because of our ignorance of the value of this hidden variable. 
If the probabilities P(a | x, \) and P(b | y, A) are restricted to 0 or 1, then we have 
a deterministic LHVM, in which 4, x and y fully determine the outcomes a and 
b. Such LHVMs are explicitly realistic in the Einstein-Podolsky—Rosen sense. 
Locality is also explicit in the model. For example, P(a | x, 4) depends on neither b 
nor y, so that the events at Bob’s station have no influence on Alice's measurement 
outcome a. A mathematical notion of ‘freedom’ is implicit in the LHVM; x and 
y are included as free parameters and not, for example, as functions of X. If 
P(a| x, A) and P(b| y, A) are allowed to take intermediate values, we speak of a non- 
deterministic LHVM. Because the unknown P(A) can take on intermediate values, 
deterministic and non-deterministic LHVMs are equivalent, and from here on we 
do not use this distinction. 

This class of models, which by construction embody the Einstein—Podolsky- 
Rosen assumptions, was shown by Bell to be incapable of reproducing the pre- 
dictions of quantum mechanics. For example, if Alice’s and Bob’s local systems 
are spin-1/2 particles in a singlet state, then their measurements (assumed to be 
ideal) will agree—that is, they will show that they are both spin-up or both spin- 
down—with probability 


Plagree|@, $,) = P(t; 114,.0,) + PUL 11d, ,) = wn ot | 


where ¢p — ¢q is the angle between Alice’s and Bob’s analysis directions. 
Equation (1) cannot reproduce all the features of this distribution. No choice of 
P(a| x, A), P(b| y, A) and P(A) can give a probability P(a, b | x, y) that simulta- 
neously depends on the difference @» — ¢a; has high visibility (ranging from 0 to 
1) and is sinusoidal. 

This difference is efficiently captured by Bell inequalities. A Bell parameter is a 
linear combination of conditional probabilities P(a, b | x, y), and a Bell inequality 
indicates the bounds (within the class of LHVMs) of a Bell parameter. Typically, the 
Bell inequalities of interest are those that are not obeyed by quantum mechanics, 
that is, those for which quantum correlations can be strong enough to violate the 
Bell inequality. Bell’s theorem shows that there are such inequalities and thus that 
quantum mechanics cannot be ‘completed’ by hidden variables. 

Bell inequalities also enable experimental tests of local realism. A Bell test is an 

experiment that makes many spatially separated measurements with varied settings 
to obtain P(a, b | x, y) estimates that appear in a Bell parameter. If the observed 
Bell parameter violates the inequality, one can conclude that the measured systems 
are not governed by any LHVM. We note that this conclusion is always statistical 
and typically takes the form of a hypothesis test, leading to a conclusion of the 
form ‘assuming nature is governed by local realism, the probability to produce the 
observed BIV (or a stronger one) is P(observed or stronger | local realism) < p’ 
This p value is a key indicator of statistical significance in Bell tests. 
‘Freedonr in Bell tests. The use of the term ‘free’ to describe the choices in a Bell 
test derives more from mathematical usage than from its usage in philosophy, 
although the two are clearly related. Bell® (see Methods, John Stewart Bell on ‘free 
variables’) states that his use of ‘free will’ reflects the notion of ‘free variables; that is, 
externally given parameters in physical theories, as opposed to dynamical variables 
that are determined by the mathematical equations of the theory. 

Any realistic LHVM is described by equation (1). The mathematical require- 
ments for the relevant ‘freedom are made evident by a more general description, 
in which the local realistic model includes also x and y and thus specifies the joint 
probability 
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P(a,b, x,y) = Jo Paix, AP(bly, P(x y|)PO) (2) 
r 


Using the Kolmogorov definition of conditional probability, P(A, B) = P(A | B)P(B), 
we find that equation (2) is reduced to equation (1), provided that P(x, y| 4) =P(x, y), 
that is, provided that the settings are statistically independent of the hidden 
variables. According to Bayes’ theorem, this condition can be written as P(A | x, y) = 
P(X) and P(x, y, A) = P(x, y)P(A). This condition is known in the literature as the 
freedom-of-choice assumption, although it implies more than just free choices. 
A more accurate term might be ‘measurement setting—hidden-variable independ- 
ence. We note that this condition does not require x to be independent of y, nor 
does it require P(x, y) to be unbiased. Similar observations emerge from the more 
complex calculations needed to assign p values to observed data in Bell tests’, 

The above describes the sense in which the basis choices should be ‘free. The 
desideratum is independence from the hidden variables that describe the particle 
behaviours, keeping in mind that the choices and measurements could, consistent 
with relativistic causality, be influenced by any event in their backward light-cones. 
Because the setting choices and the measurements will always have overlapping 
backward light-cones, it is impossible to rule out the possibility of a common past 
influence through space-time considerations. If human choices are free, however, 
such influences are excluded. We also note that complete independence is not 
required, although the tolerance for interdependence can be low*”**"*. The theory 
that the entire experiment, including choices and outcomes, is pre-determined 
by initial conditions is known as superdeterminism. Superdeterminism cannot 
be tested*4. 

A very similar concept of ‘freedony applies to the entangled systems measured 
in a Bell test. A BIV with free choice and under strict locality conditions implies 
either indeterminacy of the measurement outcomes or faster-than-light commu- 
nications and thus closed time-like curves'*"®. If Bob’s measurement outcome is 
predictable based on information available to him before the measurement, and 
if it also satisfies the condition for a BIV (namely, a strong correlation with Alice's 
measurement outcome that depends on his measurement choice), then Bob can 
influence the statistics of Alice’s measurement outcome and thus communicate 
with her, despite being space-like separated from her. Considering that Bob could 
in principle have information on any event that occurs in his backward light cone, 
this implies (assuming no closed time-like curves) that Bob’s measurement out- 
come must be statistically independent of all prior events. 

In this way, we see that ‘freedom; understood as a behaviour statistically inde- 
pendent of prior conditions, appears twice in a Bell test; first as a requirement on 
the setting choices and second as a conclusion about the nature of measurement 
outcomes in entangled systems. These two are linked, in that the second can be 
demonstrated if the first is present. 

Previous tests using physical randomness generators to choose measurement 

settings have thus demonstrated a relationship between physical processes, show- 
ing for example*”? that if spontaneous emission is ‘free, then the outcomes of 
measurements on entangled electrons are also ‘free. By using humans to make 
the choices, we translate this to the human realm, showing that, in the words of 
Conway and Kochen”, “if indeed there exist any experimenters with a modicum 
of free will, then elementary particles must have their own share of this valuable 
commodity”. Here, ‘experimenters’ are those who choose the settings, that is, the 
Bellsters. See the main text for a discussion of the locality loophole when using 
humans. 
John Stewart Bell on ‘free variables’. A brief but informative source for Bell’s 
positions on setting choices is an exchange of opinions with Clauser, Horne and 
Shimony in articles titled ‘The theory of local beables’® and ‘Free variables and local 
causality’*®. In the first of these articles, Bell very briefly considers using humans 
to choose the measurement settings 


It has been assumed [in deriving Bell’s theorem] that the settings of instruments are 
in some sense free variables - say at the whim of experimenters - or in any case not 
determined in the overlap of the backward light cones. 


whereas in the second article, Bell defends this choice of method and compares it 
against ‘mechanical’ —that is, physical—methods of choosing the settings 


Suppose that the instruments are set at the whim, not of experimental physicists, but 
of mechanical random number generators. Indeed it seems less impractical to envisage 
experiments of this kind... 


Bell proceeds to consider the strengths and weaknesses of physical random- 
number generators in Bell tests, offering arguments about why under ‘reasonable’ 
assumptions, physical random-number generators might be trusted. Nonetheless, 
he concludes 


Of course it might be that these reasonable ideas about physical randomizers are just 
wrong - for the purpose at hand. A theory might appear in which such conspiracies 
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inevitably occur, and these conspiracies may then seem more digestable than the non- 
localities of other theories. 


In summary, Bell distinguishes different levels of persuasiveness, noting that 
physical setting-generators, while having the required independence in many local 
realistic theories, cannot be expected to do so in all such theories. In contemporary 
terminology, what he argues is that physical-setting generators can only tighten, 
not close, the FOCL. 

Bell also defends his use of the concept of ‘free will’ in a physics context, 
something that had been criticized by Clauser, Horne and Shimony. Bell writes 


Here I would entertain the hypothesis that experimenters have free will [...] it seems to 
me that in this matter I am just pursuing my profession of theoretical physics. 


A respectable class of theories, including contemporary quantum theory as it is 
practiced, have ‘free’ ‘external’ variables in addition to those internal to and conditioned 
by the theory. These variables provide a point of leverage for ‘free willed experimenters, 
if reference to such hypothetical metaphysical entities is permitted. I am inclined to pay 
particular attention to theories of this kind, which seem to me most simply related to 
our everyday way of looking at the world. 


Of course there is an infamous ambiguity here, about just what and where the free 
elements are. The fields of Stern-Gerlach magnets could be treated as external. Or such 
fields and magnets could be included in the quantum mechanical system, with external 
agents acting only on the external knobs and switches. Or the external agents could be 
located in the brain of the experimenter. In the latter case the setting of the experiment 
is not itself a free variable. It is only more or less correlated with one, depending on how 
accurately the experimenter effects his intention. 


It is clear from the last three sentences that Bell considers human intention— 
that is, human free will—to be a ‘free variable in the context of the discussion. 
That is, he believes human intention fulfils the assumptions of Bell’s theorem, as 
do experimental settings faithfully derived from human intention. 

Use of ‘freedom-of-choice loophole’ and ‘locality loophole’ in this work. As 
noted above, a statistical condition used to derive Bell’s theorem is P(x, y, A) = 
P(x, y)P(A), where x and y are choices and A describes the hidden variables. This 
statistical condition, known as the freedom of choice assumption, does not dis- 
tinguish between three possible scenarios of influence: the condition could fail if 
the choices influence the hidden variables, if the hidden variables influence the 
choices or if a third factor influences both the choices and the hidden variables*>°. 
According to Bayes’ theorem, equivalent forms are P(x, y | \) = P(x, y), which 
expresses the fact that knowing does not give information about (x, y), and 
P(A| x,y) = P(A), which expresses the fact that knowing (x, y) does not give infor- 
mation about A. The latter relationship makes clear that influence (in either direc- 
tion) is incompatible with the freedom of choice assumption. The term used for this 
condition should not be taken literally; the condition can be false even if the choices 
are fully free, in the sense of being independent of all prior conditions. This occurs, 
for example, if the choices are made freely but then influence the hidden variable. 

By long tradition, ‘locality loophole is the term given to the possibility of influ- 
ence from Alice's (Bob’s) choices or measurements to Bobs (Alice’s) measurement 
outcomes. The term ‘freedom-of-choice loophole’ was introduced in Scheid] et al.!° 
to describe the influence of hidden variables on choices. The exact definition was 
“the possibility that the settings are not chosen independently from the properties 
of the particle pair”. We note that this formulation centres on the act of choosing 
and its independence, which (assuming relativistic causality, an element of local 
realism) can only be violated by influences from past events, not future events. 
These loophole definitions employ the concept of influence, which is directional, to 
explain how the non-directional relation of independence can be broken. Similarly 
directional definitions have recently been applied to experiments using cosmic 
sources”>4, 

Our use of the term in this paper follows the definition of Scheid] et a 

described above: FOCL refers to the possibility of choices being influenced by any 
combination of hidden variables or other factors within the backward light cone of 
the choice, whereas the possibility of choices influencing hidden variables, which 
necessarily occur in the forward light cone of the choice, is included in the locality 
loophole. Such a division, in addition to fitting the common-sense notion of free 
choice, avoids counting a single possible channel of influence in both FOCL and 
the locality loophole. 
Status of the FOCL. After recent experiments simultaneously closed the locality, 
detection-efficiency, memory, timing and other loopholes*’, the FOCL remains 
open. Space-time considerations can eliminate the possibility of such influence 
from the particles>*"3°’, or from other space-time regions*”, to the choices, but 
not the possibility of a sufficiently early prior influence on both the choices and 
the particles. To motivate freedom of choice in this scenario, well characterized 
physical randomizers”*”! have been used to choose settings. 

In some experiments", the physical assumption is that at least one of (i) sponta- 
neous emission, (ii) thermal fluctuations and (iii) classical chaos” is uninfluenced 
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by prior events and thus unpredictable even within local realistic theories. In other 
experiments”!>378, the physical assumption is that photodetection is similarly 
uninfluenced. While still requiring a physical assumption, and thus not closing the 
FOCL, this strategy tightens the loophole in various ways. First, by using space-like 
separation to rule out influence from certain events (for example, entangled pair 
creation) and from defined space-time regions. Second, by using well characterized 
randomness sources, for which the setting choice is known to faithfully derive 
from a given physical process, it avoids assumptions about the predictability of 
side-channel processes. Third, in the case of refs. °°, by using a physical variable 
that can be randomized by each of several processes, the required assumption 
is reduced from ‘x is uninfluenced’ to ‘at least one of x, y and z is uninfluenced. 
Prediction engine. Generation of random sequences by humans has been stu- 
died in the field of psychology for decades”””. Early studies showed that humans 
perform poorly when asked to produce a random sequence, choosing in a biased 
manner and deviating from a uniform distribution. In ref. “° it was shown that 
humans playing competitive, zero-sum games that reward uniform random 
choices tend to produce sequences with fewer identifiable biases. One such game 
is matching pennies: players must simultaneously choose between heads or tails; 
one player wins if the results are equal and the other player wins if the results are 
different. This is a standard two-person game used in game theory"! (see also 
ref. “°) with a mixed-strategy Nash equilibrium: as both players try to outguess each 
other, by behaving randomly, they do not incentivize the other player to change 
their strategy. 

The BIG Bell Quest reproduces the coin-matching game, with an MLA playing 
the part of the opponent. The MLA operates on simple principles that human 
players could employ; it maintains a model of the tendencies of the opponent, 
noting, for example, that ‘after choosing 0 and 0, the opponent usually chooses 1 
as the next bit. The MLA strategy operates with very little memory, mirroring the 
limited short-term memory of humans. 

Formally, we write x; € {0, 1} for the ith input bit, S, = {x), ..., xx} for a sequence 
of k input bits, and x = {Xj Xj4 9-9 Xj41—1} for a length-L sub-sequence of S; 
starting from bit j. Given S; as input, the algorithm predicts the value of x,,.1 € {0, 1} 
that maximizes 


(L) 
max x, x, 
Pas fi! ke AlXk—-L4 1) (3) 


where f estimates the probability of x following x in S, 


Hx let D ig Er) =x\\x, 1<i<k-L} 
Hx xM =x, 1<i<k—L} 


fF, (x|x) = (4) 


where || indicates concatenation and #A indicates the number of elements in set 
A. Equations (3) and (4) mean that the prediction algorithm identifies the most 
frequently input sequence, of length Lmax+ 1 or shorter, that the player can form 
when adding the bit x,.;, and predicts that the player will produce the bit needed 
to complete that sequence. Lmax is chosen to be 3, reflecting the limited memory 
of a human opponent in the coin-matching game. 

In equation (4), the estimator f(x | x) of the probability that x follows x in S; is 
based on modelling the user’s input as a Markov process*?. The MLA keeps a run- 
ning estimate T;(z | y), updated with each new input bit, of the matrix describing 
the probabilities of transitions among length-L words, from word y to word z. The 
estimates are simply the observed frequency of transitions in S;,. The MLA then 
obtains f; as a marginal probability distribution: the probability of the first bit of z 
being x, conditioned on the tail of S; being y (See Extended Data Fig. 1). 
Networking strategy and architecture. The BBT required reliable, robust and 
scalable operation of two linked networking tasks: providing the BIG Bell Quest 
video game experience, as well as live aggregation and streaming of user input to 
the running experiments. From a networking perspective, the latter task resembles 
an instant messaging service, with the important asymmetry that messages from a 
large pool of senders (the Bellsters) are directed to a much smaller pool of recipients 
(the laboratories). The network architecture, shown in Fig. 1c, was implemented 
using Amazon Web Services IaaS (Infrastructure as a Service) products. 

In the messaging component, we employed a two-layered architecture, shown 
in Fig. 1c. In the first layer, BBT nodes received input bits from the users and per- 
formed a real-time health check (described below) to block spamming by robot 
participants. The data were then sent to the second layer, a single-instance hub 
node that concatenated all the bits from the first stage and distributed them to the 
laboratories. The communication between the two layers was implemented using 
a memcache computation node to maximize speed and to simplify the synchro- 
nization between the two layers. 

The gaming task was handled by a single layer of game nodes and a database. 
To protect the critical messaging task from possible attacks on the gaming compo- 
nents, we used separate instances to handle back-end gaming tasks, such as user 
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information and rankings, and to handle back-end tasks in the messaging chain, 
such as data logging. Load balancers, networking devices that distribute incoming 
traffic to a scalable pool of servers, were used in both the gaming and messaging 
front ends to avoid overloading. This design pattern is known as horizontal scaling, 
and is a common practice in scalable cloud systems. 

This specific architecture was not available as a standard service from web service 

providers, but was readily constructed from standard component services. The 
architecture is not specific to the low-bitrate manual input collected for the BBT 
and could easily be adapted to other data that can be collected by personal devices, 
for example, audio or acceleration. The architecture was designed to solve a prob- 
lem specific to time-limited projects with crowd-sourced input: owing to the single- 
day nature of the BBT, the unknown number and geography of the participants 
and the possibility of hackers or spammers, it was not practical to test the system 
under full-load conditions before the event itself. The two-layer architecture helped 
maintain all the critical servers isolated and independently operating and helped 
us to scale up the system smoothly when traffic increased. In the event, the traffic 
surpassed our initial estimates, and we deployed three additional BBT nodes at 
09:00 utc (when Europe was waking up) with no interruption of service. Such 
scaling up is expected to be critical for projects (for example, ref. “*) that combine 
laboratory experimentation, which tends to be time-limited because of stability 
and resource considerations, with crowd-sourcing, which usually entails unknown 
and fluctuating demand. 
BBT nodes. The first layer of computing resources received data from Bellsters—or, 
more precisely, from The BIG Bell Quest running in browsers on their computers 
and devices. A variable number of servers running the same software functionali- 
ties were placed behind a pre-warmed load balancer that was prepared to support 
up to 10,000 simultaneous connections. Users connected to the load balancer via 
a public URL end-point and sent the data from their browsers using websocket 
connections. This first layer of servers aggregated the data from each connection 
(that is, from each user) in independent buffers during a 0.5-s interval. 

A simple but important ‘health check’ was performed to identify and block 

high-speed robotic participants. Ifa given user contributed more than 10 bits in a 
single 0.5-s interval, corresponding to a rate of more than 20 keypresses per second, 
the user account was flagged as being non-human, and all subsequent input from 
that user was removed from the data stream. No feedback was provided to the 
users if their account was flagged, to avoid leaking information about the blocking 
mechanism. This method could potentially ban honest users because of networking 
delays and other timing anomalies, but was necessary to prevent the greater risk of 
the data stream being flooded with robotic input. 
Hub node. The hub node aggregated the data from all the BBT nodes and also 
handled the connection to the laboratories. In contrast to the BBT nodes, which 
had to service connections from an unknown and rapidly changing number of 
users, the hub node aggregated data from a small and relatively stable number of 
trusted instances. Overall, the two-layer design simplified the networking task 
of delivering input from a large and variable number of users to end points (the 
laboratories) receiving aggregated data streams at variable rates. 

Laboratories connected to the hub requested to receive random bits from the 

Bellsters, which were distributed after aggregating four of the 0.5-s batches from 
the BBT nodes, that is, in intervals of 2s. At the end of each interval, bits were sent 
to each running experiment. If an experiment had requested N bits, it was sent bits 
{xq, X1,...> Xn_1}, that is, the earliest bits to arrive in that interval. Thus, the same bits 
were used simultaneously in many experiments. This helped to tighten the locality 
loophole, because an influence from the input bits on the measurements would 
have to operate in the same way in several independent experiments and in several 
locations. With the exception of experiments (12) and (13), the sent bits were used 
within the next 2-s interval. To run faster than the Bellster input rate, experiments 
(12) and (13) operated in burst mode, accumulating bits for a specific time and then 
rapidly using them. As with the BBT instances, these connections were established 
using websocket connections. When connecting to the hub node, the laboratories 
specified their bitrate requirement, which could be dynamically changed. The hub 
node then sent a stream of Bellster-generated bits at the requested rate. Archived 
bits from BBT participation prior to 30 November were cached locally at the labo- 
ratories, so that experiments could continue to run even if the flow of real-time bits 
was insufficient or interrupted. During the event, the flux of live bits was sufficient 
and no experiments used these pre-distributed bits. 
Memcache node. The interface between the BBT nodes and the hub instance was 
implemented using a memcache node. While adding an extra computing resource 
slightly increased the complexity of the architecture, it added robustness and 
simplified operations. The memcache node, in contrast to the BBT and hub nodes, 
had no internet-facing functionality, making its operation less dependent on 
external conditions. For this reason, both the BBT nodes and the hub node were 
registered and maintained on the memcache node, allowing the restart of any of 
these internet-facing instances without loss of records or synchronization. 
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In addition, as detailed in the next section, there was an additional monitor node 

in charge of (i) recording all the random bits sent from the Bellsters to the labo- 
ratories and (ii) providing real-time feedback to the Bellsters. This functionality 
was isolated from the operations of the hub node. Again, by splitting the moni- 
tor and hub instances, a failure or attack in the public and non-critical real-time 
feedback functionality had no effect on the main, private and critical random-bit 
distribution task. 
Monitor node. For analysis and auditing purposes, all of the bits passing through the 
first layer of servers were recorded in a database, together with metadata describing 
their origin (monitor computing resource in Fig. 1c). In particular, every bit was 
stored together with the username that created it and the origin timestamp. The 
random bitstreams sent to the individual laboratories were similarly recorded bit 
by bit, allowing a full reconstruction of the input to the experiments. 

In post-event studies of the input data, we estimated the possible contribu- 
tion from potentially machine-generated participations that were not blocked by 
the real-time blocking mechanism. We analysed participants whose contribution 
were substantial, more than 2 kbit in total, and looked for anomalous timing 
behaviours—such as an improbably short time spent between missions and 
improbably large number of bits introduced per mission, both of which are limited 
by the dynamics of human reactions when playing the game. By flagging parti- 
cipants that contributed such anomalous participations as suspicious and cross- 
referencing against the bits sent to the experiments, we found that no experiment 
received more than 0.1% of the bits from the 11 suspicious participants. 

In addition to using the monitor computing resource to store all the information 

that was streamed to the laboratories in a database, we also implemented a real- 
time feedback mechanism to improve the Bellsters’ participation experience. After 
accomplishing each mission, users were shown a report on the use of their input at 
each of the laboratories running at that moment, as illustrated in Extended Data 
Fig. 2d. The numbers shown were calculated as a binomial random process B(n, pj) 
with parameters n= N and p;= R;/R, where N is the number of bits introduced by 
a user in his/her last mission, R; is the number of bits sent to laboratory i, and R is 
the total number of bits entered in the last 0.5-s interval. 
Gamification. The BBT required a large number of human-generated random 
bits in a short time, thus requiring many participants, rapid input and sustained 
participation. The gamification strategy was designed to maximize all of these 
factors. Extended Data Fig. 2 shows screenshots of the game. 

While still adhering to common conventions of video games (such as levels, 
power-ups, boss battles, animations and sound effects), the intended appeal of 
The BIG Bell Quest is less its entertainment value than the opportunity to contri- 
bute to the BBT experiments and to test one’s unpredictability against a computer 
opponent. The game design incorporated internationalization, connection to social 
networking, community-building features and a feedback system to inform users 
about their contribution to the experiment, all considered essential to attract the 
necessary tens of thousands of participants. 

The game has a classic challenge-and-reward incentive structure. The chal- 
lenge is to produce random bits while avoiding being predicted by the Oracle (see 
Extended Data Fig. 2a). This reproduces the ‘penny-matching’ game studied in 
psychology*!” and resembles the well known ‘rock—paper-scissors’ game, thus 
requiring little explanation. The Oracle is an MLA that predicts player behaviour 
based on patterns in past input (described in Methods, Prediction engine). Most 
player time was spent in a rapid ‘speed game’ (see Extended Data Fig. 2b), where 
the Bellster moves along a road by hitting 0s and 1s. This part of the game requires 
rapid bit generation (a few bits per second) to complete the level in time. Every 
20 bits an indicator shows the player's ‘unpredictability —that is, the percentage 
of unpredicted bits entered thus far—and the final score reflects the number of 
unpredicted bits, with a power-up multiplier for bits entered during a particular 
time window. 

The rewards are multiple: at the individual level, the player is given a score for 
each level (to encourage a high fraction of unguessed input) and a cumulative 
score (to encourage repeated play). At the community level, a sharing platform 
offers rankings and a tool to create groups, so that Bellsters can compare their 
performance among friends and colleagues. Players can also post their scores to 
social networks (Facebook, Twitter or Weibo) at the press of a button (see Extended 
Data Fig. 2e and f). At the scientific level, the game provides a report on which 
laboratories have used how many of a player’s input and for what purpose (see 
Extended Data Fig. 2c). Finally, the player is occasionally rewarded with a short 
video pre-recorded at one of the laboratories, in which experimentalists explain a 
part of their experiment. User feedback (see Extended Data Fig. 2e) suggests that 
this approach succeeded in making Bellsters feel meaningfully involved in the 
project, with a positive effect on retention and propagation. 

The speed game-Oracle structure was repeated through three levels of 
difficulty, or ‘worlds’ (‘Users, ‘Internet’ and ‘Laboratory’), illustrating the travel of 
the bits from the fingers of the Bellsters, through the internet, to the laboratories 
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(see Extended Data Fig. 2c). The times, speeds and required unpredicted-input frac- 
tion in each of the levels were adjusted with the help of beta testers to avoid offering 
levels of trivial or impossible difficulty. The final Oracle level was objectively 
difficult even for an experienced player. To pass, it required n > 20 unguessed bits 
in a time period that allowed at most 30 bits to be entered. Even for a sequence of 
30 ideal random input bits, the condition n > 20 occurs less than 5% of the time 
according to binomial statistics. For 30 bits that are predictable with probability of 
0.6, the chance of success drops below 0.003. Nevertheless, several players persisted 
and completed the game (see Extended Data Fig. 2f). 

Data availability. Experimental data are available upon reasonable request from 
the contact author of each experiment, as indicated in the author contributions. 
Other project data are available upon reasonable request from the corresponding 
author. 
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Extended Data Fig. 1 | Markov chains. a, Markov chain for L = 1. States 
are represented by circles of different colours, and transitions between 
states by arrows coloured as the initial state. The last input bit determines 
the state of the predictor. The probability p(a | b) of a transition from state 
b to state a is estimated from the sequence S,. b, Markov chain for L=2, 
with four states determined by the last two input bits. The transitions 
model the probability p(a, b | c, d) that the user will input bits ab, given 
that the last two bits were cd. The final prediction is based on the marginal 
of the next single bit, which is extracted from these estimated probabilities. 
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Extended Data Fig. 2 | Screenshots from The BIG Bell Quest, on the use of the user’s input bits in the running experiments. Blue and 
illustrating various game elements. a, The Oracle uses an MLA to predict _ red buttons allow instant sharing on social networks Twitter and Weibo, 
user input. b, The ‘running’ component of the game, in which participants _ respectively. e, A social media post sharing participant results. f, A social 
are asked to enter a minimum number of bits with a minimum media post by a participant who completed the very difficult last Oracle 
unpredicted-input fraction in a limited time. c, Sequence of increasing- level. The BIG Bell Quest artwork by Maria Pascual (Kaitos Games). See 
difficulty levels, interspersed with Oracle challenges. d, In-game feedback also Methods, Gamification. 
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Highly durable, coking and sulfur tolerant, 
fuel-flexible protonic ceramic fuel cells 


Chuancheng Duan!*, Robert J. Kee!, Huayang Zhu!, Canan Karakaya!, Yachao Chen!, Sandrine Ricote!, Angelique Jarry”, 
Ethan J. Crumlin’, David Hook’, Robert Braun!, Neal P. Sullivan! & Ryan O’Hayre!* 


Protonic ceramic fuel cells, like their higher-temperature solid- 
oxide fuel cell counterparts, can directly use both hydrogen and 
hydrocarbon fuels to produce electricity at potentially more 
than 50 per cent efficiency. Most previous direct-hydrocarbon 
fuel cell research has focused on solid-oxide fuel cells based 
on oxygen-ion-conducting electrolytes, but carbon deposition 
(coking) and sulfur poisoning typically occur when such fuel cells 
are directly operated on hydrocarbon- and/or sulfur-containing 
fuels, resulting in severe performance degradation over time*®. 
Despite studies suggesting good performance and anti-coking 
resistance in hydrocarbon-fuelled protonic ceramic fuel cells””*, 
there have been no systematic studies of long-term durability. Here 
we present results from long-term testing of protonic ceramic 
fuel cells using a total of 11 different fuels (hydrogen, methane, 
domestic natural gas (with and without hydrogen sulfide), propane, 
n-butane, i-butane, iso-octane, methanol, ethanol and ammonia) 
at temperatures between 500 and 600 degrees Celsius. Several 
cells have been tested for over 6,000 hours, and we demonstrate 
excellent performance and exceptional durability (less than 1.5 
per cent degradation per 1,000 hours in most cases) across all fuels 
without any modifications in the cell composition or architecture. 
Large fluctuations in temperature are tolerated, and coking is not 
observed even after thousands of hours of continuous operation. 
Finally, sulfur, a notorious poison for both low-temperature and 
high-temperature fuel cells, does not seem to affect the performance 
of protonic ceramic fuel cells when supplied at levels consistent 
with commercial fuels. The fuel flexibility and long-term durability 
demonstrated by the protonic ceramic fuel cell devices highlight 
the promise of this technology and its potential for commercial 
application. 

Without any modifications in composition or architecture of the pro- 
tonic ceramic fuel cells (PCFCs), we achieve excellent performance 
with direct operation (that is, no fuel pre-processing, pre-reforming or 
conditioning) on 11 different fuel streams (Fig. 1a), many of which have 
not previously been studied in a PCFC. Our PCFC operating directly 
from the flame tip of a butane lighter (Supplementary Video 1) further 
highlights the fuel flexibility and thermal shock resistance of these cells. 
Peak power densities at 600°C (Fig. 1a, b) range from 0.66 W cm ~ for 
H) to 0.17 Wcm * for iso-octane (data at lower temperatures provided 
in Extended Data Fig. 1). Direct PCFC operation on NH; and CH;0H 
fuels approaches the performance of pure Hp, suggesting that these 
fuels could be particularly attractive for commercial PCFC applica- 
tions. Performance on simulated domestic natural gas (a mixture of 
95 vol% CHyg, 3 vol% CzH¢ and 1 vol% C3Hs, balanced with N2) exceeds 
the performance on pure methane and is insensitive to the presence 
of 19.5 p.p.m. H2S impurity, slightly higher than the maximum total 
sulfur concentration of 17 p.p.m. permitted for US domestic natural 
gas supply. Moreover, PCFC performance is not severely degraded even 
after 1,000h of continuous operation on H2S-contaminated natural 
gas (Fig. 1f). 


Performance on the linear hydrocarbon fuels (alkanes) follows the 
trend CH4 < C3Hg < C4H jo: that is, performance increases with increas- 
ing hydrocarbon chain length. In contrast, performance on alcohol 
fuels follows the inverse trend CH;0H > C,H;5OH. The temperature 
required for propane and butane steam reforming is lower than methane 
steam reforming because the C-H bond strength decreases with 
increasing hydrocarbon chain length (for example, from 435 kJ mol! 
for CH, to 412 kJ mol™! for C3Hg to 409kJ mol™! for C4Hj9). In fact, 
reforming literature suggests that propane and butane conversion can 
approach 100% at 450°C in the presence of appropriate catalysts”. The 
enhanced low-temperature reforming rates of propane and butane 
therefore contribute considerably to the enhanced PCFC perfor- 
mance on these fuels relative to methane. Gas chromatography analysis 
(Extended Data Fig. 2a, b) of the PCFC anode outlet stream under 
propane and butane operation, respectively, shows that H2 yield from 
C4H) po is higher than that from C3Hg at 600°C, matching the PCFC 
performance trends. Considering the alcohol fuels, the presence of a 
C-C bond in ethanol (which methanol lacks) greatly hinders reforming 
and can lead to unfavourable, coke-prone intermediates such as ethane 
and ethylene’®. These factors probably contribute to the superior PCFC 
performance observed for methanol relative to ethanol and are con- 
sistent with previous observations in solid-oxide fuels cells (SOFCs)!". 

In most cases, we conducted PCFC testing at inlet steam-to-carbon 
ratios, S:C (or oxygen-to-carbon ratios, O:C), just outside the graphitic 
carbon deposition region based on equilibrium thermodynamic cal- 
culations’? (Fig. 1c). To aid comparison, we tested all fuels at similar 
O:C ratios, targeting O:C values of about 2.5:1 which are within the 
range that can be realistically met using commercially feasible SOFC 
balance-of-system designs’. For the higher hydrocarbon fuels (pro- 
pane, butane and iso-octane), the target O:C ratio was difficult to 
achieve using steam alone; in these cases, we also supplied some Op 
to the fuel stream to meet the target O:C ratios. Previous analyses 
have shown that thermally sustaining operation of PCFCs with direct 
internal steam-methane reforming at S:C ratios of 2.5:1 is feasible’. 
Irreversible heat generation from the cell is more than sufficient to offset 
the endotherm from in situ steam-methane reforming. In fact, direct 
internal steam-reforming, as conducted in this work, has additional 
(and substantial) process intensification benefits because the close 
physical co-location of the endothermic steam reforming reactions 
and exothermic electrochemical processes aid in rapid and efficient 
heat transfer. 

Figure 1c illustrates a key difference between SOFC and PCFC anode 
mass balance when operating on hydrocarbon fuels!»'®. During PCFC 
operation, hydrogen is removed from the anode via protons transiting 
from anode to cathode. In contrast, during SOFC operation, oxygen is 
introduced into the anode by oxygen ions transiting from the cathode 
to the anode. Thus, as the fuel is used, PCFC and SOFC anode mass 
balances follow different trajectories even when starting from identical 
compositions of the inlet fuel stream. Even though the PCFC and SOFC 
starting compositions are the same, the ending PCFC and SOFC anode 
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Fig. 1 | PCFC performance across a range of fuels. a, Current density 
versus voltage (j-V) and current density versus power density (j-P) curves 
of PCFCs under hydrogen (cell 1, fuel F1), ammonia (cell 2, F2), methanol 
(cell 3, F3), iso-butane (cell 4, F4), n-butane (cell 5, F5), simulated 
desulfurized natural gas (cell 7, F6), simulated natural gas with 19.5 p.p.m. 
H.S (cell 7, F7), propane (cell 4, F8), methane (steam/carbon ratio S:C = 2, 
cell 7, F9), methane (S:C = 2.5, cell 8, F11), ethanol (cell 10, F10) and iso- 
octane (cell 9, 0.1 Acm™~? at 600°C at 0.82 V, 0.2 Acm~? at 600°C at 0.73 V, 
0.3 Acm “ at 600°C at 0.62 V, F12) at 600°C. b, Peak power densities 

of PCFCs on 12 different fuel streams at 600 °C (methane is shown for 

two different O:C ratios). Lifetime and degradation rate of PCFCs on 
selected fuels also indicated. NG, natural gas. c, Ternary diagram shows 
regions of equilibrium carbon formation and full oxidation, with dots for 
experimental fuel compositions. Fuel composition trajectories for propane 
(red, 20 vol% C3Hs + 20 vol% O + 60 vol% H,O) and methane (blue, 33.3 
vol% CH4+ 66.7 vol% H2O) are shown as solid lines for PCFC (indicating 
removing H) and dashed lines for SOFC (adding O). The end-point 
compositions correspond to complete oxidation of the fuel stream to CO, 
and H,O. Thus, these mass-balance trajectories essentially correspond to 
the progress of the anode reaction (that is, the extent of fuel utilization). 
‘Deposits’ indicates that carbon deposition (coking) is thermodynamically 
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favourable in that region. d, OCV for propane (red) and methane (blue) 
fuel streams as functions of H removal for PCFC at 500°C, and of O 
removal for SOFC at 500 and 800°C. e, Long-term stability of direct 
i-butane/propane (cell 4; i-butane fuel stream composition is 10 vol% 
C4Hj9 + 20 vol% O2 + 70 vol% H2O; propane fuel stream composition 

is 20 vol% C3Hg + 20 vol% O2+ 60 vol% H2O; current density for both 
fuels is 0.2 Acm~7) and methanol (cell 3, fuel stream composition is 50 
vol% methanol + 50 vol% HO, current density j =0.4Acm~*) PCFCs 

at 500°C and iso-octane PCFC (cell 9, j; =0.1 Acm~? at 600°C at 0.79 V, 
jo=0.2 Acm at 600°C at 0.73 V, j; =0.3 Acm at 600°C at 0.62 V) at 
600°C (green dots are the linear fitting results of the degradation trend; 
overall degradation rate is calculated based on its slope). f, Long-term 
stability of direct-natural gas PCFC with 19.5 p.p.m. H2S (cell 7) at 500°C 
with an output current density of 0.25 Acm * and long-term stability of 
direct-ethanol PCFC (cell 11) at 550°C with a constant current density of 
0.2 Acm ? (voltage fluctuation is due to the unstable fuel supply and high 
steam concentration). g, Nyquist plots measured at OCV of a PCFC 

(cell 7) under sulfur-free simulated natural gas, simulated natural gas with 
19.5 p.p.m. H2S, and methane at 600°C. Zim is the imaginary component 
and Z,, the real component of impedance. 
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compositions are different because HO is produced in the SOFC anode 
(versus the cathode for PCFC). The dilution of the anode fuel stream 
in the SOFC leads to a decrease in the Nernst potential relative to the 
PCFC as a function of the extent of reaction (Fig. 1d). Thus, the PCFC 
can maintain higher operating voltage than a comparable SOFC, par- 
ticularly at high fuel utilization’. 

Although the lack of anode H2O production can potentially enhance 
PCFC performance relative to the SOFC, it also yields anode gas com- 
positions that are more prone to coke formation. As shown in Fig. Ic, 
the compositions of PCFC anode gas reside within, or skirt just outside, 
the thermodynamic coke formation boundary across the full extent 
of reaction. In contrast, SOFC anode gas compositions quickly evolve 
away from the coke formation boundary with increasing extent of reac- 
tion. Despite the strong possibility for coking under PCFC operating 
conditions as indicated in Fig. 1c, we did not observe signs of coking 
during long-term PCFC operation on any of the tested fuels, and car- 
bon deposits were not observed in post-mortem analyses of the cells 
(Extended Data Fig. 3a-f). The triple conducting oxide cathode and the 
electrolyte/cathode interfacial microstructures were likewise unaffected 
after long-term operation (Extended Data Fig. 4a, b). Several cells were 
continuously tested for more than 6,000h without any sign of coking 
(Fig. le), including one cell operated for 2,800 h on i-butane followed 
by an additional 3,200h on propane (Fig. le, primary y-axis) anda 
second cell operated continuously for about 8,000 h on methanol (Fig. le, 
secondary y-axis). In both cases, the long-term degradation rates were 
<1.5% per 1,000 h. Additional cells were tested on iso-octane (chosen 
as an analogue for petrol (gasoline), Fig. le secondary y-axis), natu- 
ral gas (with 19.5 p.p.m. H,S, Fig. 1f, secondary y-axis), and ethanol 
(Fig. 1f, primary y-axis) for >1,000h each, again without observable 
signs of coking and with long-term degradation rates around 4% per 
1,000h, 10% per 1,000h and 1.5% per 1,000h, respectively. In addi- 
tion, we compared the coking resistance of our PCFC anode, based on 
nickel/yttrium-doped barium zirconate (Ni/BZY) to a conventional 
SOFC anode based on nickel/yttria-stabilized zirconia (Ni/YSZ), by 
in situ high-temperature Raman spectroscopy (in situ HT-Raman). 
These studies reveal that the intrinsic coking resistance of the Ni/BZY- 
based PCFC anode is higher than that of a Ni/YSZ-based SOFC anode 
and provide insights into the mechanisms underlying this remarkable 
coke-resistance (Extended Data Fig. 5). 

To aid direct comparison of PCFC performance on methane, desul- 
furized natural gas and natural gas with sulfur impurities (19.5 p.p.m. 
H,S), we operated a single PCFC on all three fuel streams consecutively. 
As reported in Fig. 1a, b and Extended Data Fig. 1, PCFC performance 
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on desulfurized and H2S-containing natural gas slightly exceeds per- 
formance on pure methane. We attribute this difference to the small 
concentration of more-easily reformed higher hydrocarbon species (for 
example, C,H, and C3Hs) in real natural gas. Figure 1g provides electro- 
chemical impedance spectroscopy Nyquist plots of this cell on all three 
fuels at open-circuit voltage (OCV), demonstrating that the ohmic 
resistance remains constant and the HS has no obvious effect on the 
electrode polarization resistance. This is in contrast to the typical behav- 
iour in SOFCs, in which both the ohmic and electrode-polarization 
resistances generally increase in H2S-contaminated fuel streams”. 
The long-term, coke-free, sulfur-tolerant, direct-fuelled fuel cell per- 
formance summarized in Fig. 1 is perhaps without precedent. SOFCs, 
which typically operate at temperatures above 700°C, generally suffer 
from coking when operated directly on hydrocarbon fuels, even at O:C 
ratios higher than 2:1. Furthermore, standard consensus holds that 
coking should be exacerbated at lower temperatures, and the anode 
mass balance trajectories shown in Fig. 1c suggests that PCFCs should 
have greater propensity to coke than SOFCs under the same operating 
conditions. Nevertheless, the PCFCs examined here achieve long-term 
coke-free operation at temperatures considerably lower than their 
SOFC counterparts. Previous stability data from PCFCs and SOFCs 
operated on hydrocarbon/alcohol fuels reported in the literature 
(Extended Data Table 1) show that the PCFCs that we describe here 
deliver exceptional lifetime and durability. Multiple factors contribute 
to this behaviour, including several unique features of PCFCs that have 
been previously noted in the literature and several that have not. 
First, a propitious in situ process of nanocatalyst formation occurs 
inside the PCFC anode during operation that probably contributes 
to durability and anti-coking resistance. In many cases, fuel cell per- 
formance increases slightly during the first several hundred hours of 
operation (for example, Fig. le), coinciding with the in situ forma- 
tion of nanoscale nickel catalyst particles in the PCFC anode during 
the initial stages of operation under reducing conditions. Figure 2a—c 
demonstrates the time-dependent evolution of the exsolved Ni nano- 
particle phase starting from a fresh anode (Fig. 2a), after operation on 
humidified methane for 300h at 600°C (Fig. 2b), and after operation 
on humidified methane for 1,400h at 500°C (Fig. 2c). This exsolution 
process leads to the formation of a relatively dense and uniform coat- 
ing of Ni nanoparticles (about 5-100 nm in diameter) across the BZY 
electrolyte phase. Nanoparticle density increases slightly with time 
(compare Fig. 2b with Fig. 2c), and some agglomeration is seen, 
although the particles are still well dispersed and most particles remain 
well below 100 nm in size (Extended Data Fig. 6a, b). Transmission 


Fig. 2 | In situ exsolution of Ni nanoparticles 
during PCFC operation. a, Scanning electron 
micrograph (SEM) of BZY20 (BaZro.sYo.2.03-s) 
PCFC anode as sintered. b, SEM of BZY20 
PCFC anode after 300h operation on 
humidified methane (28.5 vol% CH4+71.5 
vol% HO) at OCV at 600°C. c, SEM of 
BZY20 PCFC anode after 1,400 h operation on 
humidified methane (20 vol% CH, + 50 vol% 
H,O + 30 vol% Ar) under an output current 
density of 80 mA cm ~ at 500°C. d, TEM 
micrograph of a lift-out sample from the BZY20 
PCFC anode (Fig. 2c) prepared by focused ion 
beam (FIB). The red box is the particle on the 
grain boundary to be analysed in Fig. 2e, f. 

e, High-angle annular dark field (HAADF) 
image of a select area with one particle on the 
grain boundary (inset, one HAADF image 
with lower magnification to show the location 
of particle). f, Overlay of Ba, Zr, Y, Niand O 
energy-dispersive X-ray spectroscopy (EDS) on 
the HAADF micrograph and bright field (BF) 
confirming that the nanoparticle is Ni. Panels 
e, f were acquired from the reverse side of the 
sample relative to d. 
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electron microscopy (TEM) investigations confirm that the nano- 
particles are Ni (Fig. 2d, e, f). Additional evidence in support of the 
in situ Ni exsolution process is provided in Extended Data Figs. 7, 8. 
Irvine and co-workers have recently demonstrated similar exsolution 
of Ni nanoparticles in SrTiO3-based perovskite catalyst supports and 


Water-gas shift 


observed greatly enhanced coking resistance for the exsolved Ni catalyst 
relative to comparable SrTiO3 supports with deposited Ni nanoparticle 
catalysts’®, 

As schematically illustrated in Fig. 3a, b and further discussed in 
the Extended Data, the unique anode microstructure resulting from 


Carbon cleaning mechanism 


(1) H,O = OH(BZY) + H(BZY) 
(2) C,q(Ni) + OH(BZY) = COH(Ni) + BZY 
(3) COH(Ni) + O(Ni) = COOH(Ni) + Ni 

(4) COOH(Ni) + OH(BZY) = CO, + HO 
(5) C,q(Ni) + H,O(BZY) = CO, + H, 

0) CO, + BZY = CO,(BZY) 

(7) CO,(BZY) + BZY = CO(BZY) + O(BZY) 


i 
ZN 
6) CO,(BZY) + O(BZY)= 0 oO 
BZY BZY 
Q) CO,(BZY) + OH(BZY) = CO,H(BZY) + BZY 


(10)  CO(BZY) + OH(BZY) = COOH(BZY) + BZY 
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Fig. 3 | Carbon and sulfur mitigation mechanism. a, Schematic 
illustration of PCFC and mechanism of hydrocarbon reforming, water- 
gas shift reaction, sulfur cleaning and carbon cleaning. b, Mechanism of 
carbon cleaning. Caq indicates carbon absorbed on the surface of Ni. ¢, In 
situ HT-Raman spectra of a carbon-contaminated Ni/BZY-based anode 
after exposure to dry 1 vol% Hp (balance Ar) and humidified 1 vol% 

Hz (10 vol% H20, balance Ar) at 500°C. d, In situ HT-Raman spectra 

of a carbon-contaminated Ni/BZY-based anode after exposure to 50 
vol% dry CO: + 49.5 vol% Ar+0.5 vol% Hy (dotted arrow indicates the 
timeline of experimental progression). e, In situ HT-Raman spectra of a 
carbon-contaminated Ni/BZY-based anode after continuous exposure 
to humidified 1 vol% Hy (8 vol% HO) at 400°C for 60 min; and in situ 
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Raman shift (A cm-') 


Time (h) 


HT-Raman spectra of a carbon-contaminated Ni/YSZ-based anode after 
continuous exposure to humidified 1 vol% Hy» (8 vol% HzO) at 400°C for 
70 min, then humidified 1 vol% H, (13 vol% HO) at 400°C for 40 min, 
then humidified 1 vol% H (13 vol% H2O) at 450°C for 20 min and finally 
humidified 1 vol% H) (13 vol% HO) at 500°C for an additional 40 min. 
f, In situ HT-Raman spectra of a carbon-contaminated Ni/YSZ-based 
anode after exposure to 50 vol% dry CO2+ 49.5 vol% Ar+0.5 vol% 

H, for various intervals up to 20h followed by exposure to humidified 
CO; (40 vol% CO, + 10 vol% H,0 + 49.5 vol% Ar+ 0.5 vol% H2) for 
various intervals up to an additional 28h at 500°C. g, Mass spectrometer 
SO, signal intensity as a function of time for PCFC anode during direct 
internal reforming of H2S-contaminated natural gas. 
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this Ni exsolution process facilitates rapid reforming chemistry with 
anti-fouling properties, thanks to the intimate communication between 
the Ni nano-islands and the supporting BZY electrolyte phase. Several 
researchers have observed that BZY can contribute to HO dissociation 
to form surface hydroxyl (OH,,)) and H(.) compounds that can serve 
as surface-intermediate species in carbon removal and water-gas shift 
(WGS) reactions. Water dissociation can occur without a barrier on the 
BZY support!’. Thus, HO adsorption becomes the limiting step of the 
OH, formation reaction, and H,O uptake by the BZY support proba- 
bly determines the carbon-removing reaction rate. At PCFC operating 
temperatures (500-600 °C), BZY is known to have a high H,O-uptake 
capacity”®. As OH,,) species are formed on the BZY, they can react with 
the adsorbed carbon on the Ni, suppressing coke formation’. Nanoscale 
Ni particles well dispersed on the BZY support provide high surface area 
and close interaction of the surface adsorbates (either between two Ni 
surfaces, or between Ni and BZY) by decreasing the distance for spill- 
over (migration of adsorbates from the Ni onto the BZY and vice versa). 
Depending on the reaction temperature and the H2O concentration 
in the feed stream, reaction between the surface carbon C(Ni) and the 
surface hydroxyl OH(BZY) may follow different catalytic paths and form 
various surface intermediates such as formyl (COH), carboxyl (COOH) 
or formate (HCOO)?3, which then further react to CO and CO,”*. 
The overall effect of this process seems to be stable catalytic behaviour 
even for long time-on-stream operations as observed in the Fig. 1 data. 

Key in situ Raman evidence for the active role of the BZY phase 
in removing carbon is provided in Fig. 3c, d. A carbon-contaminated 
Ni/BZY anode was loaded into the in situ HT-Raman chamber under 
dry 1% Hz; and heated to 500°C (Fig. 3c). Exposure to dry 1% H, does 
not decrease the intensity of the disordered and graphitic carbon 
Raman signals, indicating that carbon cannot be removed under dry 
reducing atmospheres. However, after switching to a wet reducing 
atmosphere, the carbon signals are eliminated within 60 min. These 
results strongly suggest that water plays an important role in the elimi- 
nation of carbon. In addition, a Raman peak associated with -CO3 
transiently appears (and then disappears) during this carbon cleaning 
process in wet hydrogen, suggesting that the introduction of HO leads 
to the formation of surface -CO3 groups that are also involved in the 
carbon removal process. 

To further verify the above mechanism, we carried out additional in 
situ HT-Raman spectroscopy studies (Fig. 3d). Upon exposure to dry 
CO, we again observe the elimination of carbon in concert with the 
formation of -CO3 species. In this case, the -COs persists because, with- 
out H,0, surface hydroxyls are not available for further reaction. These 
results demonstrate that carbon can be removed through the formation 
of -CO3, whereas subsequent removal of the -CO3 requires -OH/H,0. 

We pursued analogous experiments on a Ni/YSZ anode as summa- 
rized in Fig. 3e, f. In marked contrast to behaviour of Ni/BZY, hydra- 
tion, hydroxylation and -CO; formation are not detected in the Ni/YSZ 
anode, and carbon removal does not occur upon exposure to humid- 
ified hydrogen, even when resorting to higher temperatures and/or 
higher levels of humidification. Furthermore, the carbon cannot be 
removed in dry or wet CO . Investigations of carbon-contaminated 
Ni/BZY and Ni/YSZ (Extended Data Fig. 9a, b) by in operando ambient- 
pressure X-ray photoelectron spectroscopy also show that Ni/BZY 
exhibits enhanced carbon removal compared with Ni/YSZ. 

The BZY support also enhances catalyst stability under sulfur- 
containing hydrocarbon gas. Although sulfur is found in natural gas 
in various forms (thiols, COS, CS; and H2S), under reforming con- 
ditions it mostly converts to H2S, which has a well-known poisoning 
effect on Ni surfaces. Even H2S concentrations as low as 1-5 p.p.m. 
can decrease catalytic activity by selectively adsorbing on Ni**, and 
sulfur-poisoned Ni has no activity below 700°C”®. The sulfur uptake 
of the catalyst is related to the Ni surface area. Thus, Ni nanoparticles 
with high dispersion and high catalytic surface area can tolerate greater 
sulfur poisoning. Typically, sulfur can be removed through reaction 
with surface oxygen species to form SO3. Over a Ni/BZY catalyst, sul- 
fur is preferentially adsorbed on the Ni rather than the BZY support’. 
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Fig. 4 | Thermal cycling measurement of hydrogen-fuelled PCFC 
cycling between 260°C and 550°C. a, Amplified temperature profile 
and cooling/heating rate. Heating and cooling rates reach as high as 
28°C min™!. b, Output current was recorded at a constant voltage of 
0.85 V. Based on the last cycle, the cell required 18 min to fully recover 
from 260°C to its original temperature and performance at 550°C. 


Thus, the BZY support is readily available for H2O uptake and hydroxyl 
formation. We speculate that this OH(BZY) can help to remove the 
adsorbed sulfur from the Ni surface. As indirect evidence of this reac- 
tion, we detect SO, production during the reforming of H2S-containing 
natural gas in our PCFC anode (Fig. 3g). This is consistent with 
recent studies by Yang et al.’, who observed high sulfur tolerance for 
Ni/BaZr9.1Ceo,7Y02-xY b,O3_s anode materials (although their electrolyte 
composition was somewhat different). A series of plausible elementary 
reaction steps (see Methods section) can be proposed for sulfur clean- 
ing from the Ni surface via the bi-functionality of the Ni/BZY system. 

In addition to coking resistance and sulfur tolerance, thermal cycling 
stability is crucially important for fuel cell commercialization, particu- 
larly for applications requiring start/stop capability, or for transient or 
variable loads. We subjected a representative cell to 32 rapid thermal cycles 
while operating at a constant potential of 0.85 V (Fig. 4a, b). The tem- 
perature and current density profiles for the last thermal cycles (Fig. 4b) 
show that the cell power output dropped to zero as the cell tempera- 
ture fell to about 260°C. The cell required 18 min to fully recover its 
original temperature and performance at 550°C. Between each thermal 
cycle and upon completion of the full 32 thermal cycles, the cell recov- 
ered to >99.5% of its initial current density, suggesting highly stable 
intrinsic thermal-cycling capability of the PCFC cell materials. The 
exceptional fuel flexibility and long-term durability demonstrated by 
the Ni/BZY-based PCFC devices reported here, combined with their 
thermal cycling stability, highlight the promise of this technology and 
its potential for commercial application. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0082-6. 
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METHODS 
PCEC fabrication. PCFCs were fabricated by the previously developed solid-state 


reactive sintering (SSRS) method. The precursor powders for the anode and elec- 
trolyte were prepared by simple mixing and drying processes. For example, the 
anode precursor powder of 40 wt% BZY20 + 60 wt% NiO was prepared by mixing 
stoichiometric amounts of BaCO3 (99.8% Alfa Aesar), ZrO (99.7% Alfa Aesar), 
Y 03 (99.9% Alfa Aesar), and NiO (Ni78.5 wt%, Alfa Aesar) with 20 wt% starch 
pore former (based on the sum of BZY20 and NiO). This raw precursor mixture 
was ball-milled in isopropanol with 3-mm beads of YSZ for 48h, followed by dry- 
ing at 90°C for 24h. Similarly, the electrolyte precursor powder of BZY20+ 1.0 
wt% NiO was prepared by mixing appropriate amounts of BaCO3, ZrO) Y203 
according to the desired BZY20 stoichiometry, with the addition of 1.0 wt% NiO as 
a sintering aid. This raw precursor mixture was blended using the same ball-milling 
and drying procedures used for the anode precursor. 

In this work, the previously developed triple conducting oxide, 
BaCoo 4Fep.4Z10.1Y0,103—5 (BCFZY0.1)’, was used for the cathode electrode. 
BCFZY0.1 precursor cathode powders were synthesized by the improved sol-gel 
method. In brief, appropriate stoichiometric amounts of Ba(NO3)2 (Alfa Aesar), 
Co(NO3)2:H2O (Alfa Aesar), Fe(NO3)3-9H2O (Alfa Aesar), ZrO(NO3)2 35 wt% 
in dilute nitric acid (Sigma Aldrich), Y(NO3)3-6H2O (Alfa Aesar) and EDTA 
(Alfa Aesar) were dissolved in 28-30% ammonium hydroxide (Alfa Aesar) under 
continuous heating and stirring. Citric acid was then introduced with the molar 
ratio of EDTA acid: citric acid: total metal ions controlled to be around 1.5:1.5:1. 
Subsequently, NH3H2O or HNO; was used to adjust the pH value to around 9, 
and the solution became immediately clear. Upon evaporating the water, a dark 
purple gel was obtained. The gel was then put into a drying oven at 150°C for 24h 
to get the primary powder, which was then ball-milled with 1-butanol as solvent 
for 48h. The powder was dried at 90°C for 24h and then calcined at 600°C for 
5h followed by ball-milling with isopropanol as solvent for 24h and dried at 90°C 
for 24h to obtain the final powder for cathode paste preparation. We prepared 
the cathode paste by mixing 4g of the respective powders with 1.5 g BCZYYb 
(BaCeo 7Zr0.1Y0,1Ybo103—5) + 1 wt% NiO powder prepared by SSRS and calcined 
at 1,400°C for 18h, 1 g dispersant (20 wt% solsperse 28000 (Lubrizol) dissolved 
in 3 terpineol), and 0.3 g binder (5 wt% V-006 (Heraeus) dissolved in terpineol). 

Precursor pastes of the electrolyte were prepared by mixing 30 g of the respec- 
tive powders with 6 g dispersant (20 wt% solsperse 28000 (Lubrizol) dissolved in 
3 terpineol), and 2 g binder (5 wt% V-006 (Heraeus) dissolved in terpineol). The 
anode precursor powder was dry-pressed under 375 MPa for 2 min in a circular 
carbon-aided steel die set with a diameter of 19mm to produce 2-mm-thick green 
anode pellets. A thin electrolyte precursor paste layer (about 10-20 1m after firing) 
was deposited on each side of the green anode pellets by screen-printing. (The 
electrolyte was applied to both sides of the cell to minimize asymmetric stress 
issues during sintering.) This ‘half-cell’ anode/electrolyte structure was then fired 
at 1,450°C for 18h. After firing, the extra (stress-compensating) electrolyte layer 
was removed by grinding. The cathode (about 10-20 1m thick) was printed on the 
electrolyte by brush/screen printer and the cell was fired at 900°C for 5h, resulting 
in a complete single cell (porous anode | dense electrolyte | porous cathode) with a 
final sintered diameter of about 15 mm and effective active area of 0.5cm?. 
PCEC testing. These membrane-electrode assemblies were sealed onto an alu- 
mina tube with glass powder (the sealing temperature was 900°C) and reduced 
under H) for 24h at 600°C. I-V polarization tests of the PCFC button cells were 
performed with 300 ml min“! dry air and 15ml min“! dry hydrogen as oxidant and 
fuel respectively by a Gamry Reference 3000 over a range of temperatures from 
350°C to 600°C. The electrolyte is a mixed protonic and oxygen-ion conductor”. 
We note that even the small (for example, 2-3%) partial oxygen ion conduction 
expected under these operating conditions is enough to provide 3-4% steam 
content on the anode side of the PCFC owing to electrochemical reaction of the 
hydrogen with the oxygen ions. Thus, even though ‘dry’ hydrogen is supplied to 
the anode, sufficient hydration of the anode chamber is sustained. For the thermal 
cycling stability testing, the clamshell testing furnace was turned off and manually 
opened to the ambient in order to subject the cell to a rapid cooling rate. Using 
this approach, we achieved cooling rates as high as 28 °C min~!. For all other fuels, 
testing was performed over a range of temperatures from 500°C to 600°C (or 
350-600°C where noted) with 300 ml min~! (STP) air as oxidant and the following 
fuel stream conditions: 

(1) Iso-butane: 10 vol% iso-C4H 19 + 20 vol% O24. 70 vol% H20 (flow rate of 
iso-C4Hjo: 0.9ml min~!). 

(2) Natural gas: 33.3 vol% natural gas (5 ml min™!) + 66.7 vol% H,0. The sim- 
ulated natural gas was provided by Praxair Technology Inc. and had 95 vol% CHg, 
3 vol% CaH,g, 1 vol% C3Hg (balanced with N2) with or without 19.5 p.p.m. H2S. 

(3) n-butane: 10 vol% n-C4H 319 +20 vol% O2+ 70 vol% HO (flow rate of 
n-C4Hjo: 0.9 ml min— 1, 

(4) Iso-octane: UHP nitrogen at a flow rate of 5 ml min“! was flowed through a 
bubbler containing iso-octane with constant temperature (approximately 21°C). 
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The mixture of iso-octane and nitrogen (about 5.5 vol% iso-octane + 94.5 vol% 
nitrogen) was flowed through a second bubbler containing heated water (85°C) 
giving a final inlet fuel composition of 2.3 vol% i-CgHj + 39.7 vol% N2 +58 vol% 
H,0 (0.29 ml min“! i-CsHj), This produced a H,O:CgH|¢ ratio of 25:1 which is 
somewhat higher than the minimum S:C ratio of 16:1 calculated based on the 
stoichiometric requirements for the complete steam reforming of iso-octane. 

(5) Propane: 20 vol% C3H g +20 vol% O2+ 60 vol% H2O (flow rate of C3Hs: 
1.6ml min“ '). Testing was conducted from 500°C to 600°C. 

(6) Methane: 28.5 vol% CHy (5ml min~!) +. 71.5 vol% HO (S:C =2.5) or 33.3 
vol% CH, (5mlmin~'!) + 66.7 vol% HO (S:C = 2). Testing was conducted from 
500°C to 600°C. 

(7) Methanol: Fuel stream introduced as an evaporated mixture of methanol 
and water (molar ratio of water:methanol of 1:1, flow rate of mixed liquid is 
0.024ml min ~'). Testing was conducted from 350°C to 600°C. 

(8) Ethanol: Fuel stream introduced as an evaporated mixture of ethanol 
and water (molar ratio of water:methanol of 7:1, flow rate of mixed liquid is 
0.005 ml min“). 

(9) Ammonia: 10 ml min~! NH3. Testing conducted only at 600°C. 

SOFC and PCFC open-circuit potential calculations. The open-circuit potentials 
calculated in Fig. 1d are based on an air flow stream with a composition of 20 vol% 
O», 77 vol% N2 and 3 vol% H20. The fuel flow stream takes the equilibrium com- 
position based on the given C, H and O fractions at the investigated temperature 
and pressure. However, Hz is considered to be the only electrochemically active 
fuel species in the fuel stream. 

Evaluation of the open circuit potential for PCFCs is not straightforward. 
Although the yttrium-doped barium zirconates used in most PCFCs are dom- 
inantly proton conductors, they are in fact mixed ionic-electronic conductors. 
Among other attributes of such electrolyte membranes, the OCV (that is, Nernst 
potential or equilibrium reversible potential) cannot be evaluated simply in terms 
of gas-phase composition differences across the electrolyte membrane. 

The electronic contributions to conductivity typically take the form of small 
polarons. The electronic current associated with polaron mobility is often referred 
to as a ‘leakage current, which serves to reduce the OCV relative to a membrane 
that is a pure ion conductor. The measurable OCV can be evaluated, but in addi- 
tion to the gas-phase compositions, it depends in a more complex way upon the 
charge-defect mobilities within the membrane'®”**°. The reduced OCVs reported 
here do account for the mixed-conduction and leakage current. 

By way of contrast, considering H) as the fuel, the OCVs of fuel cells based 
on YSZ membranes and pure protonic ceramic membranes (pure PCFC) can be 
evaluated as 


() 


O,a 


In these expressions, p represents the gas-phase partial pressure and subscripts 
‘a and ‘c’ refer to anode and cathode, respectively, and F is Faraday’s constant 
(96,485 C mol). Even with the same temperature and gas-phase composition 
across the electrolyte membrane, the OCV may be different for pure proton and 
oxygen-ion conducting membranes. Depending on the particular operating con- 
ditions, OCVs for the present BZY electrolyte membranes are typically a few tens 
of millivolts lower than would be the case for a pure proton-conducting membrane. 
High-temperature X-ray diffraction. High-temperature X-ray diffraction 
(XRD) analyses were performed using a Bruker D8 Discover TWIN/TWIN 
Diffractometer with a vertical goniometer (radius is 280 mm), Cu Ka radiation, 
tube voltage of 40kV and tube current of 40 mA. The hot stage is an Anton Paar 
HTK 2000N and utilizes a 0.5-mm Pt strip stage. Intensities were collected in the 
2 theta range between 15 and 90 with a step size of 0.02 and a measuring time of 
0.5s at each step. The heating ramp rate was 5°C min ', with a 10-min hold at each 
temperature before the scan was started. 
Structure and morphology characterization. X-ray diffraction (XRD) analyses 
of all powders and pellets were performed at room temperature using a Philips 
X Pert Pro MPD diffractometer (PANalytical, Almelo, Netherlands) with Cu Ka 
radiation, tube voltage 45 kV and tube current 40 mA. Intensities were collected 
in the 2 theta range between 10° and 120° with a step size of 0.008° and a measur- 
ing time of 5s at each step. The microstructure and chemical composition of the 
sintered pellets were investigated by means of Field Emission Scanning Electron 
Microscopy (FE-SEM, JEOL JSM7000F). TEM samples were prepared by FIB (FEI 
Helios Nanolab 600i). TEM and energy-dispersive spectroscopy (EDS) were per- 
formed with a FEI Talos F200X TEM/STEM. 
In-situ high-temperature Raman spectroscopy measurements and discussion. 
Anode pellets of Ni/BaZro sg Y.203-s (Ni/BZY) were prepared by SSRS method using 
the same procedure as for the complete PCFCs. Ni/YSZ-based anode power was 
prepared by mixing NiO (Alfa Aesar, 60 wt%) with YSZ (Fuelcellmaterials.com) 
and 20 wt% starch pore former (based on the sum of YSZ and NiO). Anode pellets 
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of Ni/YSZ were prepared in the same geometry and manner as the Ni/BZY based 
anodes. After sintering at 1,450°C for 18h, both Ni/BZY and Ni/YSZ-based anode 
pellets were polished to 0.5mm and reduced at 600°C for 150h under dry hydro- 
gen. Some reduced pellets were quenched to room temperature under hydrogen 
for coking resistance measurements. To study the details of the carbon cleaning 
process, some reduced Ni/BZY anode and Ni/YSZ anode pellets were treated for 
4h under dry 99.9% methane at 500°C to purposely trigger carbon deposition 
(coking). 

A Harrick environmental chamber was used for the in situ Raman tests. 
Temperature was controlled by the temperature controller of the chamber. Gas 
compositions and flow rates were controlled by mass flow controllers. Total gas flow 
rate for all measurements was 50 ml min~!. The gas composition of dry methane 
for in situ Raman tests is 5% CH4 + 95% Ar. Steam concentration was controlled 
by a heated bubbler and measured by a humidity sensor. 

The Raman spectroscopy measurements were conducted in situ with the 
Harrick environmental chamber using an Alpha 300R Confocal Raman micro- 
scope (WiTec Instruments Corp., Knoxville, TN) with a doubled Nd:YAG green 
laser operating at 532 nm (maximum power: 39 mW). All experiments were per- 
formed at constant laser power. An Olympus SLMPLN 20 x objective (Olympus, 
Center Valley, PA) with a working distance of 25mm was used for all the meas- 
urements. 

In contrast to the PCFCs presented in this work, direct hydrocarbon operation 
in SOFCs often results in coking even when operating with S:C ratios at or above 
the thermodynamically predicted carbon deposition boundary. In Extended Data 
Fig. 5, we compare the coking resistance of our Ni/BZY-based PCFC anode with 
that of a conventional Ni/YSZ-based SOFC anode by in situ high-temperature 
Raman spectroscopy. As shown in Extended Data Fig. 5a, carbon is detected on 
the Ni/YSZ-based anode after just 20 min exposure to humidified methane (S:C 
ratio = 2) at 500°C. In comparison, carbon is not detected on the Ni/BZY-based 
PCFC anode after 360 min exposure to methane under the same conditions, nor 
even upon long-term (>4,000 min) exposure to humidified methane at $:C = 1 
(Extended Data Fig. 5c). Instead of coking, lattice protons (OH,*) and surface 
carbonate groups (-CO3) are formed on the Ni/BZY anode under these conditions, 
both of which we hypothesize play a role in coke and sulfur mitigation. We attribute 
these differences in surface chemistry to the basic nature of BZY versus the acidic 
nature of YSZ. We hypothesize that a series of surface reaction processes (Fig. 3a, b 
main manuscript) are responsible for the remarkable coking resistance observed 
in the Ni/BZY anode. 

Because BZY is highly basic, surface hydration and hydroxylation readily 

occur, thereby increasing the O:C ratio on the surface. In humidified atmospheres, 
absorbed carbon on the Ni or BZY surface will first react with hydroxyl groups 
on the BZY surface, producing CO; (Fig. 3b, reactions (1)-(4)). The CO, product 
can subsequently absorb on the BZY surface to form -CO3 groups, which can 
subsequently react with additional -OH groups in a ‘cleaning loop. Reaction steps 
involving the -OH reaction intermediate are kinetically fast, and thus the steady- 
state surface coverage of -OH remains low. 
Mechanism of sulfur and carbon cleaning. As discussed in the main paper, 
there are important surface-chemistry differences between PCFCs and SOFCs 
that probably contribute to the unique coke and sulfur-resistant properties 
of PCFCs. PCFC electrolytes such as BZY and yttrium-doped barium cerate 
(BCY) are highly basic*!. In comparison, common SOFC electrolytes such as 
YSZ and gadolinium-doped ceria (GDC) are considerably more acidic. Because 
the electrolyte phase is present as a considerable volume fraction (20-40%) in 
standard Ni-based cermet (ceramic—metal composite) fuel cell anode architec- 
tures for both PCFCs and SOFCs, the resulting surface chemistry differences 
can greatly affect anode behaviour. Specifically, several recent SOFC studies”?! 
have demonstrated that the hydration of BZY/BCY can increase the O:C ratio 
on surface and at two phase boundaries with Ni which can aid carbon removal 
and increase the coking resistance. The presence of proton conducting ceramic 
particles in the anode can also inhibit the adsorption of sulfur and help with 
sulfur removal!” 

Density functional theory studies suggest that on Ni (111) and Ni(211) surfaces, 
COOH(Ni) is most likely to form among the three carbon-containing hydroxyl 
species. Furthermore, a previous study”? suggests that the rate-determining step 
for the WGS reaction is the formation of COOH(Ni). The coke removal rate is 
found to be directly proportional to the COOH(Ni) formation rate as well. The 
plausible surface reaction steps in which COOH(Ni) is formed and consumed are 
highlighted below. The adsorbed species on the Ni are designated as X(Ni) whereas 
surface adsorbates on the BZY support are designated as X(BZY). 

As the reaction pathways suggest, the bi-functionality of Ni/BZY occurs via 
reaction of Ni and BZY surface adsorbates. Although the mechanism is not shown 
in detail, the interaction of the two surface adsorbates can be explained by spill- 
over density arguments*”. The physical distance between the two adsorbates can 
determine the reaction rates. 


The following elementary reaction steps are proposed for sulfur cleaning from 

Ni via surface hydroxyl species. 

(1) H)S + Ni=S(Ni) + Hp 

(2) S(Ni) + OH(BZY) = SOH(Ni) + BZY 

(3) H.O + Ni= H,O(Ni) 

(4) H,O(Ni) = OH(Ni) + H(Ni) 

(5) OH(Ni) + Ni= O(Ni) + H(Ni) 

(6) SOH(Ni) + O(Ni) = SO2(Ni) + H(Ni) 

High-temperature ambient pressure X-ray photoelectron spectroscopy meas- 
urements. Pellets of BZY10 (BaZro 9Y0,103_5) and 8YSZ (Zro,.92Y0.0g02~5) with pat- 
terned Ni electrodes were prepared for the ambient pressure X-ray photoelectron 
spectroscopy (AP-XPS) measurements. 

The BZY10 substrates were fabricated by SSRS. BaCO3, ZrO, and Y2O3 were 
mixed in stoichiometric proportions with 1 wt% NiO. The mixture of powders was 
pelletized and sintered 10h at 1,550°C. 

8YSZ (from Tosoh) was pressed and the specimens were also sintered for 10h at 
1,550°C. After sintering, the pellets were polished down to 0.8 mm thickness, and 
Ni comb-shaped electrodes were sputtered on one side of the pellets. 

High-temperature AP-XPS measurements were performed at the beamline 9.3.2 
at Lawrence Berkeley National Laboratory's (LBNL) Advanced Light Source (ALS). 
The pellets, one at a time, were set on a boron nitrate heater, with a thermocouple 
placed directly onto the sample surface. The samples were heated to 450°C under 
ultra-high vacuum (UHV) to clean the surface and remove adventitious carbon. 
In the case of the 8YSZ pellet, the sample was then cooled down to 300°C and 
100 mTorr of steam was introduced in the main chamber. During the cleaning 
process, low-resolution survey spectra from 650 eV to —10 eV (binding energies) 
were collected continuously with a beam photon energy of 710 eV. The binding 
energies for the collected spectra were then subsequently calibrated to the Zr 3d 
photoemission peak at 181.2 eV. 

Here, the UHV cleaning behaviour of carbon-contaminated Ni/BZY (Extended 
Data Fig. 9a) and Ni/YSZ (Extended Data Fig. 9b) patterned microelectrode 
structures are compared. Both Ni/BZY and Ni/YSZ samples were subjected to 
in situ UHV treatment at 450°C while AP-XPS survey spectra were continu- 
ously recorded*?*4, Within 25 min, the adventitious carbon contamination was 
completely removed from the Ni/BZY sample as shown by the disappearance of 
the Cls peak at 285 eV binding energy (Extended Data Fig. 9a). In contrast, the 
Ni/YSZ sample still showed noticeable carbon contamination even after 80 min 
UHV heat treatment (Extended Data Fig. 9b). To promote the cleaning process, the 
Ni/YSZ sample was therefore subsequently exposed to steam (Py20 = 100 mTorr) 
at 300°C for an additional 150 min. Even after this treatment, however, the Cls 
signal was not fully eliminated, thereby underscoring the difficulty in removing 
carbon from a Ni/YSZ SOFC anode electrode relative to a Ni/BZY PCFC anode 
electrode. A more extensive surface-analysis study of Ni/BZY versus Ni/YSZ under 
hydrocarbon fuel streams is planned. 

Sulfur mitigation experiment. We loaded 4 g anode powder, after calcination at 
1,450°C for 18h, into a packed bed reactor. The powder was reduced under pure 
hydrogen at 600°C for 100h. After reduction, the gas composition was switched 
to 33 vol% H2 +67 vol% HO and the SO; mass spectrometer signal intensity 
(64amu) was recorded with a MKS Cirrus 2 mass spectrometer. After about 18 min, 
the gas composition was switched to 33 vol% simulated natural gas with 19.5 p.p.m. 
H2S + 67 vol% H3O and the SO, mass spectrometer signal intensity (64amu) was 
continuously recorded. The H and natural gas flow rate were held constant at 
5mlmin“!. 

Data availability. The data that support the finding of this study are available from 
the authors on reasonable request. See author contributions for specific data sets. 
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Extended Data Fig. 1 | Performance of PCFCs under 10 fuels. a, j-Vand —_j-P curves of PCFC under 33.3 vol% natural gas (simulated natural gas 


j-P curves of PCFC under hydrogen (flow rate of hydrogen: 15 ml min“, with 19.5 p.p.m. H)S, flow rate of natural gas: 5 ml min“, cell 7) + 66.7 

cell 1). b, j-V and j-P curves of direct-methanol PCFC at 600-350°C. vol% H,O. g, j-V and j-P curves of PCFC under 33.3 vol% natural gas 
Flow rate of the mixture of methanol and water (molar ratio of water/ (simulated natural gas without H,S, flow rate of natural gas: 5 ml min™!, 
ethanol is 1, cell 3) is 0.024ml min!. c, j-V and j-P curves of PCFC under _ cell 7) + 66.7 vol% HO. h, j-V and j-P curves of PCFC under 33.3 vol% 

10 vol% iso-C4H 9 + 20 vol% O2+ 70 vol% H20 (flow rate of iso-C4H jo: methane (flow rate of methane: 5 ml min“, cell 7) + 66.7 vol% H,0. i, j-V 
0.9 ml min’, cell 4). d, j-V and j-P curves of PCFC under 10 vol% n- and j-P curves of PCFC under 28.5 vol% methane (flow rate of methane: 
C4Hj9 + 20 vol% O72 +70 vol% H;O (flow rate of n-C4Hjo: 0.9 ml min“, 5mlmin“|, cell 8) +71.5 vol% HO. j, j-V and j-P curves of direct-ethanol 
cell 5). e, j-V and j-P curves of PCFC under 20 vol% C3Hg + 20 vol% PCFC at 600°C and 550°C. Flow rate of the mixture of ethanol and water 
O2 + 60 vol% HO (flow rate of C3H¢: 1.6 ml min“|, cell 4). f, j-V and (molar ratio of water/ethanol is 7, cell 10) is 0.005 ml min~!. 
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Extended Data Fig. 2 | Anode-side exhaust gas composition of PCFCs on propane and iso-butane. a, Direct-propane PCFC. b, Direct-iso-butane 
PCEC. 
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Extended Data Fig. 3 | SEM images of PCFC (Cell 4) after running for 
about 6,000 h on hydrocarbons at 500°C. a, Anode (low magnification). 
b, Anode (high magnification). c, The sandwich structure of this cell 
after running for about 6,000h. d, The interface between electrolyte 

and cathode. e, High magnification of cathode after 6,000 h operation. 

f, Raman spectrum of the PCFC anode after 6,000 h of operation on 
hydrocarbon fuel at 500°C. A carbonate peak is visible at 1,060 cm7! 

but the graphitic carbon (G band) and disordered carbon (D band), 
which would be present at 1,580cm7! and 1,350cm7|, are not apparent, 
indicating the absence of graphitic carbon and disordered carbon deposits 
in the anode even after long-term operation on hydrocarbon fuel. Long- 
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term stabilities of direct hydrocarbons PCFCs were tested at 500°C. At this 
temperature, the structure type of carbon species should be polymeric, 
amorphous films or filaments (Cg), vermicular filaments, fibres and/or 
whiskers (C,), or graphitic platelets or films (C,)*°. Typically, these 
carbon species are visible (by SEM) in SOFC anodes after running on 
hydrocarbon fuels, but the high-magnification SEM images of our PCFC 
anode (Extended Data Fig. 3a, b) show no visible evidence of such carbon 
structures. To further substantiate this conclusion, post-mortem Raman 
analyses show that there are no disordered and graphitic carbon species 
found in the PCFC anode even after long-term operation on hydrocarbon 
fuels (Extended Data Fig. 3f). 
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Extended Data Fig. 4 | TEM image and EDS mapping of PCFC (Cell 3) maps of Ba, Zr, Y, Co, Fe and O. There is no chemical reaction between 
after 8,000 h operation on methanol. a, TEM image of lamella from the electrolyte and electrolyte which indicates the excellent chemical 
BZY20 PCFC electrolyte/cathode interface prepared by FIB.b-h, HAADF compatibility of cathode with electrolyte. EDS mapping of cathode shows 
image of the electrolyte/cathode interface, and corresponding EDS there is no obvious elemental segregation. 
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Extended Data Fig. 5 | Coking resistance of Ni/BZY and Ni/YSZ anodes 
investigated by in situ HT-Raman spectroscopy. a, Raman spectra of 
Ni/YSZ anode exposed to humidified methane (S:C = 2; 500°C) at various 
times up to 200 min. b, Raman spectra of Ni/BZY anode exposed to 
humidified methane (S:C = 2; 500 °C) at various times up to 360 min. 


Raman shift (acm’) 


c, Raman spectra of Ni/BZY anode exposed to humidified methane 

(S:C = 1; 500°C) at various times up to 72 h (4,320 min). d, Raman spectra 
of Ni/YSZ anode exposed to humidified methane (S:C ~ 1; 550°C) at 
various times up to 40 min. e, Raman spectra of Ni/BZY anode exposed to 
humidified methane (S:C ~ 1; 550°C) at various times up to 100 min. 
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Extended Data Fig. 6 | SEM image of the anode of a PCFC after running 
for 1,400 and 6,000 h on hydrocarbons at 500°C. a, b, Operation for 
6,000 h on iso-butane and propane (cell 4). c, Operation for 1,400h 

on methane. d, Operation for 6,000 h on iso-butane and propane (cell 

4). The Tamman temperature of nickel is 581°C. The target operating 
temperature of our PCFCs is lower than 600°C, and most of the long-term 
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stability testing conducted in this study was at 500°C (that is, well below 
the Tamman temperature), at which Ni sintering/coarsening is avoided. 
Indeed, as shown in Extended Data Fig. 6c, d, the distribution and size of 
Ni nanoparticles on the BZY anode support remain essentially constant 
between 1,400 and 6,000 h of operation. 
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Extended Data Fig. 7 | The mechanism of SSRS and exsolution of Ni 
nanoparticles. a, Exposure to reducing conditions (for example, H2 or 
hydrocarbon fuel) at typical PCFC operating temperatures triggers the 

in situ exsolution of Ni nanoparticles onto the surface of the BZY phase 

in the cermet anode. This exsolution process is driven by the decreased 
solubility of Ni in the BZY phase under anode operating conditions 
(500-600 °C, highly reducing) compared with the much higher starting 
solubility of Ni in the BZY phase under anode synthesis conditions 

(1,450 °C, highly oxidizing). In addition, fuel cell operation also drives the 
decomposition of a residual BaY,NiO; phase which is a by-product of the 


solid-state reactive-sintering method (SSRS) used to synthesize the anode. 


As illustrated here, the SSRS process enables the anode to be rapidly and 
inexpensively fabricated in a single step starting from a homogeneous 
mixture of BaCOs3, ZrO2, Y203, NiO and pore former, which is fired 
under oxidizing conditions at 1,450°C for 18h. During firing, a complex 
phase-formation and sintering process, involving the transient formation 
and decomposition of BaY,NiOs, leads to the creation of a porous two- 
phase NiO + Ni-doped BZY composite anode with a small amount of 
dispersed residual BaY,NiOs. During initial fuel cell operation, the NiO 
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phase is reduced to Ni metal, while the Ni-doped BZY is reduced to BZY°®, 
concomitant with the exsolution of Ni nanoparticles which we hypothesize 
enhance the performance and durability of the fuel cell. b, Amplified 

main peaks of XRD patterns of BZY20 phase after sintering in air with 
different NiO amount in precursors. The amount of Ni diffusing into the 
BZY is determined by the defect reaction equilibrium. The amplified main 
XRD peaks of BZY20 phase shift to the higher angles with increasing 
amounts of Ni in the anode. This is consistent with a decreasing lattice 
constant with increasing Ni substitution on the B-site of the BZY lattice 
owing to the smaller size of Ni?* compared with Zr** or Y*+. c, XRD 
patterns of BZY20 anode after sintering in air and reduction in hydrogen 
at 600°C for 150h. d, Amplified main peaks of XRD patterns of BZY20 
phase after sintering in air and reduction in hydrogen at 600°C for 150h. 
The peak shifts to smaller angles after reduction which is consistent with 
the exsolution of Ni from BZY lattice, leading to a concomitant increase 

in the lattice constant of the BZY phase. e, BaY,NiOs phase formation 
temperature profile investigated by in situ high-temperature XRD. The 
colour indicates the intensity of the main peak of BaY,NiOs. 
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Extended Data Fig. 8 | SEM images of the Ni nanoparticle exsolution for 50h. Nickel nanoparticles begin to form on the triple junction points 
process in the PCFC anode. a, b, BZY20 phase of the anode before and grain boundaries. e, f, BZY20 phase of the anode after reduction 
reduction. The surface of the BZY20 phase is very clean, and there are no under hydrogen at 600°C for 100h. More Ni nanoparticles begin to form 
nanoparticles along the grain boundaries or on the triple junction points. along grain surfaces and boundaries. The size of exsolved Ni nanoparticles 


c, d, BZY20 phase of the anode after reduction under hydrogen at 600°C is less than 100 nm. 
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Extended Data Fig. 9 | Evolution of the AP-XPS elemental survey spectrum of Ni/BaZro9Y0,103-5 (BZY) and Ni/Zro.92Y0.0g02—5 (8YSZ) patterned 
microelectrodes. a, Ni/BZY-based microelectrode, b, Ni/YSZ-based microelectrode. 
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Double-layered perovskite High operatin 
(Pro.aSro.s)3(Feo.asMoo.15)207 (DLP- 750 °C Ethane ~1,5%/100 hours H gh operaling - 
iemperatures 
PSFM) 
: A 6 _ 6, High operating 49 
Copper—iron—ceria-YSZ 800 °C Methane 21.5%/46 hours temperatures 
reer ° No degradation after 80 High operating 50 
Lao.3Sro.eCeo.1Nio.1Ti0.203-5 900 °C Dry methane heirs temperatures 
2.5wt.%MgO-modified Ni-SDC 800 °C Humidified (3% H20) methane | ~13.3%/330 hours High operating 54 
temperatures 
Ni- MnO/Ni-YSZ 800 °C Humidified (3% H20) methane | ~20.4%/24 hours High operating 52 
temperatures 


a, Comparison of stabilities of PCFCs and SOFCs on hydrocarbon/alcohol fuels from 2013 to 2017 (this work, and refs. 37-42); b, Summary of anode materials developed for direct hydrocarbon 


so FCs73738.43-52_ 
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Rapid energy-efficient manufacturing of polymers 
and composites via frontal polymerization 
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Thermoset polymers and composite materials are integral to 
today’s aerospace, automotive, marine and energy industries and 
will be vital to the next generation of lightweight, energy-efficient 
structures in these enterprises, owing to their excellent specific 
stiffness and strength, thermal stability and chemical resistance’. 
The manufacture of high-performance thermoset components 
requires the monomer to be cured at high temperatures (around 
180 °C) for several hours, under a combined external pressure and 
internal vacuum®. Curing is generally accomplished using large 
autoclaves or ovens that scale in size with the component. Hence 
this traditional curing approach is slow, requires a large amount 
of energy and involves substantial capital investment’. Frontal 
polymerization is a promising alternative curing strategy, in which 
a self-propagating exothermic reaction wave transforms liquid 
monomers to fully cured polymers. We report here the frontal 
polymerization of a high-performance thermoset polymer that 
allows the rapid fabrication of parts with microscale features, 
three-dimensional printed structures and carbon-fibre-reinforced 
polymer composites. Precise control of the polymerization kinetics 
at both ambient and elevated temperatures allows stable monomer 
solutions to transform into fully cured polymers within seconds, 
reducing energy requirements and cure times by several orders of 
magnitude compared with conventional oven or autoclave curing 
approaches. The resulting polymer and composite parts possess 
similar mechanical properties to those cured conventionally. This 
curing strategy greatly improves the efficiency of manufacturing 
of high-performance polymers and composites, and is widely 
applicable to many industries. 

Present technologies for manufacturing high-performance thermo- 
set and fibre-reinforced polymer composite (FRPC) parts rely on curing 
in large, expensive autoclaves or ovens. For example, curing a small 
section of the Boeing 787’s carbon fibre/epoxy fuselage is estimated 
to require 350 gigajoules (GJ) of energy during its eight-hour cure 
cycle, producing more than 80 tons of carbon dioxide’. Consequently, 
there has been much interest in producing these materials with less 
energy, reducing their cost and environmental impact and furthering 
their application in commercial markets®*"!!. Frontal polymerization 
is a promising curing strategy that substantially reduces manufactur- 
ing burdens by using the enthalpy of polymerization to provide the 
energy for materials synthesis, rather than requiring an external energy 
source!?. In frontal polymerization, a solution of a monomer and a 
latent initiator is heated locally until the initiator is activated for polym- 
erization of the monomer, producing heat from the polymerization 
that further drives the reaction. The autoactivation process produces a 
propagating reaction wave that rapidly transforms the available mono- 
mer into polymer. Frontal polymerization has been used to synthesize a 
variety of polymeric materials, including functionally graded polymers, 
nanocomposites, hydrogels, sensory materials and FRPCs'**'. Most 
of the materials used in frontal polymerization to date, however, are 


unsuitable for high-performance applications. For example, although 
acrylate monomers possess the requisite energy density and reactivity 
to frontally polymerize, the mechanical properties of the resulting 
polymers are greatly inferior to those used in structural FRPCs. By con- 
trast, epoxy monomers produce mechanically robust polymers, but are 
challenging to frontally polymerize because of their lower reactivity?”””. 
Moreover, it is essential for successful manufacturing of FRPCs that the 
liquid monomer be stable and essentially free of background polymeri- 
zation at room temperature. These requirements motivate the develop- 
ment of frontal-polymerization chemistry with a controllable and 
stable processing window, a high energy density and reactivity, and a 
mechanically and thermally robust polymer product. 

Here, we demonstrate that well-controlled frontal polymerization 
facilitates the rapid production of high-performance thermoset and 
FRPC parts with minimal energy input. Furthermore, the process is 
compatible with commonly used manufacturing techniques and pro- 
duces high-quality thermoset materials. Frontal curing of FRPCs is 
challenging because a high volume fraction of fibres is necessary to 
produce a composite material with good mechanical properties, yet 
the corresponding reduction in resin content reduces the exother- 
mic energy density available for frontal polymerization. As such, the 
frontal-polymerization chemistry must have a high molar enthalpy of 
polymerization and sufficiently high rate of polymerization to prevent 
frontal quenching. Fabricating small components with frontal polym- 
erization is similarly challenging because much of the heat of polym- 
erization is lost to the environment through air or tooling surfaces”>*. 

The frontal ring-opening metathesis polymerization (FROMP) of 
dicyclopentadiene (DCPD) using a thermally activated ruthenium 
catalyst exhibits the high energy density, high reactivity and low vis- 
cosity required for the synthesis of high-performance thermosets 
(Fig. la). The resulting polydicyclopentadiene (pDCPD) is a cross- 
linked thermoset polymer that is suitable for the fabrication of durable 
resin and FRPC parts, owing to its high fracture toughness, impact 
resistance, stiffness and chemical resistance?*-?”. However, FROMP 
chemistry has been severely limited in the past by its short pot life of 
less than 30 minutes”®”*. Recently, we demonstrated that inhibitors of 
the alkyl phosphite family substantially extend the room-temperature 
liquid-processing window for FROMP of DCPD up to 30hours”. Here, 
we use phosphite-inhibited FROMP of DCPD to efficiently fabricate 
neat pDCPD and carbon FRPC structures. Compared with conven- 
tional curing, our frontal-polymerization strategy reduces energy 
requirements by more than ten orders of magnitude for large compo- 
nents (Fig. 1b). 

Phosphite-inhibited DCPD containing second-generation Grubbs’ 
catalyst (GC2) slowly transforms at room temperature from a liquid 
to a viscoelastic gel. Remarkably, the gelation of the monomer does 
not result in concomitant spontaneous polymerization, as observed 
with previous FROMP chemistries. Tuning the inhibitor concentration 
allows access to a range of rheological profiles between the low-viscosity 
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liquid and free-standing elastomeric gel, all of which frontally poly- 
merize upon thermal activation (Fig. 1c). These free-standing gels are 
deformable and easily embossed; the resulting patterned gel structure 
can be quickly fixed into a rigid pDCPD structure via FROMP (Fig. 2a—c 
and Supplementary Video 1). Moreover, when the liquid monomer 
is poured onto a micropatterned substrate, peeled off in the gel stage, 
and polymerized by FROMP, a high-fidelity replica is produced in one 
step (Fig. 2d, e). 

The gel is amenable to three-dimensional (3D) printing during the 
high-viscosity fluid stage, whereby the viscous liquid is extruded from 
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Fig. 1 | Overview of the frontal-polymerization concept. a, Scheme for 
the FROMP of DCPD using a ruthenium catalyst (Ru; that is, second- 
generation Grubbs’ catalyst) and an alkyl phosphite inhibitor (P(OR)3). 

b, Comparison of the energy required for curing versus the time taken for 
curing via frontal polymerization and conventional curing. Given that the 
only energy needed for frontal curing is the initiating stimulus, the process 
requires several orders of magnitude less energy than conventional curing, 
while also occurring faster and with less expensive equipment. References 
7 and 11 describe the manufacturing of a section of the Boeing 787 fuselage 
and a 900 cm? carbon FRPC panel, respectively. We calculated that the 
curing of a section of the Boeing 787 fuselage by FROMP (red square) 
reduces the energy consumption by ten orders of magnitude compared 
with conventional techniques. c, In the technique used here, the FROMP 
solution is triggered to polymerize in its liquid stage, or allowed to form a 
gel at room temperature and later activated. In both cases, a rapid FROMP 
reaction transforms the liquid or gel into a durable thermoset polymer. 


a print head and frontally polymerized immediately upon exiting the 
nozzle, allowing for the simultaneous free-form printing and curing of 
thermoset polymers (Fig. 2f and Supplementary Video 2). Matching the 
print-head velocity to the front velocity allows for the manufacture of 
free-form complex architectures that are not possible with traditional 
additive manufacturing approaches (Fig. 2g, h). Once printing is com- 
plete, the part is fully cured (Extended Data Fig. 1c) and there is no 
need for further processing. 

In marked contrast with traditional autoclave processing, we can 
fabricate FRPC parts in less than 5 minutes through FROMP of woven 
carbon fibres infused with the monomer solution (Fig. 2i, j). The very 
low viscosity of the liquid monomer at room temperature (~1.5 cP) 
allows the resin to be rapidly infused into continuous-fibre layups 
that have a high volume fraction of fibres via out-of-autoclave pro- 
cessing techniques such as vacuum-assisted resin-transfer moulding 
(VARTM). FROMP is triggered by briefly powering a resistive heating 
wire embedded in the layup, which provides sufficient thermoelectric 
stimulus (Fig. 3a and Supplementary Video 3). The high reactivity of 
DCPD facilitates FROMP in thin FRPC laminates with up to 50 vol% 
fibre reinforcement, while remarkably reducing the manufacturing 
time and required energy. We can further reduce the manufacturing 
time by using multiple triggering points (Fig. 3b and Supplementary 
Video 4) or by propagating the reaction through the thickness using 
a resistive heater underneath the layup (Fig. 3c and Supplementary 
Video 5). However, commensurately more triggering energy is required 
in these configurations (Fig. 3a-c). We expected the relatively low 
volume fraction of monomers in the FRPCs to reduce the frontal 
velocity. Surprisingly, however, the frontal velocity in the FROMP-FRPC 
(9.8cm min7!) was higher than that observed in neat resin for the 
same formulation (7.5cm min~'). We surmise that the carbon fibre 
conducts heat from the exothermic reaction and preheats a region of 
monomer ahead of the front, accelerating the reaction and producing 
a higher frontal velocity. 

Numerical simulation of a carbon-fibre bundle, or tow, in the resin 
supports this hypothesis. Using a simple model system—which consists 
of a single carbon-fibre tow made of 3,000 individual fibres suspended 
in neat resin—we observe that the carbon fibres accelerate FROMP 
and produce a distinct change in the front shape from flat to conical 
(Fig. 3d, e). To provide analytical insight, we carried out numerical 
simulations of the thermochemical problem using a transient, non- 
linear finite-element solver. We used an adaptive meshing scheme to 
capture the sharp gradients in temperature and degree of cure in the 
immediate vicinity of the propagating polymerization front. As shown 
by the computed thermal solutions in Fig. 3d, e, the higher thermal 
conductivity of the fibre tow changes the thermal field ahead of the 
front, thereby modifying the shape and speed of the front. The numer- 
ical solver provides useful insights into FROMP by creating a direct 
link between the cure kinetics model and FROMP propagation. For 
example, simulations of multipoint initiation predict a temperature 
spike upon front convergence, because the heat of reaction is no longer 
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conducted away from the front. This thermal overshoot is observed in 
two-point initiation experiments (Fig. 3b), and, if excessive, may lead to 
material degradation near the location of merging fronts. Optimization 
strategies for the manufacture of FRPCs with minimum curing time 
and required energy coupled with high quality will require similar com- 
putational modelling approaches. 


a In-plane FROMP: one wire b In-plane FROMP: two wires © Through-thickness FROMP 
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Fig. 2 | Advanced manufacturing with 
FROMP. a, FROMP in a free-standing 

gel, propagating radially from a single 
initiation-point source. b-e, Macropatterned/ 
micropatterned pDCPD produced by gel 
FROMP. In b, a flat gel sheet is rolled into a 
helix structure before FROMP. Inc, a flat gel 
sheet is imprinted with an ILLINOIS stamp 
before FROMP. In d and e, gel sheets are 
produced by moulding at room temperature 
for 18 hours and followed by FROMP to 
solidify the micropattern. f, 3D printing of gel 
DCPD solution that is solidified by FROMP 
immediately following extrusion from the print 
head. g, h, Free-form 3D-printed structures 
produced via FROMP. i, A corrugated carbon 
FRPC part fabricated by FROMP using 
vacuum-assisted resin-transfer moulding. 

j, A 900 cm? carbon FRPC panel (with 51% 
fibre volume fraction) cured by FROMP in 

5 minutes using about 750] of energy. 


Carefully selecting the phosphite inhibitor and its concentration ena- 
bles us to control room-temperature rheological profiles as well as fron- 
tal velocity (Fig. 4a). Figure 4b divides the rheological characteristics of 
a particular formulation into three regimes. Prior to the gel point, liquid 
FROMP is suitable for a wide variety of infusion techniques that require 
a low-viscosity resin. After the gel point, the material exhibits increased 
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Fig. 3 | Composite fabrication and modelling. 
The FROMP curing of FRPC is a versatile 
process. a, Heating one embedded wire ignites a 
single front that propagates (left to right) to fully 
cure the FRPC in about 2 minutes. b, Heating 
two embedded wires (opposite ends) ignites 
two fronts that merge in the middle, completing 
curing in half the time (about 1 minute). 

c, Heating the layup from below ignites a front 
that travels through the thickness of the panel 
and completes the cure in about 30 seconds. 

Still images are captured using a thermal 
infrared camera. Scale bar in a—c represents 
5cm. d, A propagating FROMP reaction wave 
captured experimentally and via simulation. The 
optical difference between the solid and liquid 
phases shows that the front shape is slightly 
convex in the direction of propagation (left to 
right). This frontal shape is closely matched by 
simulation. e, In the presence of a single carbon- 
fibre tow, the frontal shape is altered and the 
front is accelerated near the tow surface. The 
increase in front speed from d to e is roughly 
10%. Scale bar in d, e represents 4mm. 


10 MAY 2018 | VOL 557 | NATURE | 225 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


as b 107 500 # 
-7 = TMP 406 | Liquid FROMP ' Gel FROME: No FROMP ios 
a e TEP ! 3D printing | propagation | 499 & 
=e @ TBP z 
— 10°48 = I I Q 
£ co | "4 a 
65 oy i} L300 9 
@ 10° t j ek 
24 3 ! T a) 
8 3 103 1 t ' S 
2, B 10° | } | L200 '§ 
2 = i ' g 
Bo 10? | | of 
5 ‘@ L100 & 
a | + 1o'4 Te S 
! = Residual heat ce 
10° r —+— T T r r @ 
0 5 10 15 20 25 30 35 40 45 0 2 4 = 
Gel time (h) 
c d e 
_ 30,23 Young's modulus 60 3.5 maYoung’s modulus 
i) OC Tensile strength 4 . os CrTensile strength a 
oO o # 3.0 o 3 
6 2.5 503 3 oe) 2 
g 2 2f05 £ 2 
a no 
SB 20 40° Se 3 ® 
3 a $520 3 Fl 
uel + Oo8t cel oO 
° 1.5 308 $a fs) =| 
€ Q@ e515 € =a 
2 S$ 35 o = 
> 1.0 ae a 10 $ S 
2 0.5 105 Kc O05 2 s 
> £ 2 
0.0 0.0 0 
Liquid Gel Oven cure Oven cure Liquid Gel Oven cure Oven cure Conventional FROMP Conventional 
FROMP FROMP DCPD_ BPA epoxy FROMP FROMP  DCPD BPA epoxy cure DCPD cure 
DCPD DCPD DCPD DCPD DCPD BPA epoxy 
Resin cure method Resin cure method Composite production method 


Fig. 4 | Characterization of the FROMP curing approach. a, Gel time 
and frontal velocity are controlled by the identity of the phosphite 
inhibitor (TMP, trimethyl phosphite; TEP, triethyl phosphite; TBP, tributyl 
phosphite) and its concentration (0.3, 0.5, 1, 2, 4 or 8 molar equivalents 
with respect to GC2 catalyst). FROMP is no longer observed with 9, 12 

or 14 molar equivalents of TMP, TEP and TBP, respectively. Error bars 
represent standard deviation from the mean (n = 3). b, Representative 
rheological and thermal behaviour for curing at room temperature (23 °C) 
of a FROMP solution containing 1 molar equivalent TBP inhibitor with 
respect to GC2 catalyst (highlighted yellow data point in a), showing 

three rheological regimes. The liquid FROMP stage is suitable for 
resin-infusion/transfer manufacturing. After gelation, both 3D printing 
and the macropatterning/micropatterning of gels are possible. After 
vitrification, FROMP is no longer possible. The residual heat at each stage 
is determined by DSC analysis. The initial data point at t= 0.19 hours 


elasticity, enabling access to additional processing techniques. During 
the first part of the post-gelation regime, the monomer solution is a vis- 
cous liquid suitable for 3D printing (vide supra). It exhibits shear thin- 
ning behaviour (Extended Data Fig. 1d) and is readily extruded from a 
print head under pressure. In the latter stages of this regime, the mate- 
rial forms a free-standing gel that is resistant to plastic flow. This stage 
is useful for casting micropatterns and imprinting (vide supra). The 
regime ends with a secondary vitrification transition that corresponds 
to the point at which there is no longer sufficient chemical energy in 
the reactive formulation to sustain FROMP. For larger concentrations 
of tributyl phosphite (TBP), this transition occurs more than 40 hours 
after mixing at 23 °C, providing sufficient time for manufacturing. For 
stiff gel applications, a trimethyl phosphite (TMP) formulation that 
gels quickly may be desirable. For 3D printing and composite manu- 
facturing, a longer processing window is desired; therefore, TBP or 
triethyl phosphite (TEP) should be used at the highest concentration 
that still enables FROMP. 

We monitored the residual exotherm by differential scanning 
calorimetry (DSC) during room-temperature curing, to examine the 
effect of gelation on the cure profile and to correlate the rheological 
transitions with the degree of cure (Fig. 4b). We found that there is a 
slight shift of the exothermic peak to lower temperatures as the curing 
proceeds, and that the peak broadens substantially (Extended Data 
Fig. 1a). This behaviour suggests that the reactive formulation becomes 
more sensitive to increases in temperature during gelation; this sen- 
sitivity facilitates frontal polymerization in the partially cured state. 
The increase in reactivity may result from the formation of new cata- 
lyst species in situ, as seen in *'P nuclear magnetic resonance (NMR) 
spectroscopy of the gelled material*”. The pDCPD produced after both 
liquid and gel FROMP exhibits a minimal residual exotherm, indicating 
that a complete cure is achieved (Extended Data Fig. 1b). 
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indicates the total enthalpy of the reaction of dicyclopentadiene 

(H, = 362.8 + 13.1J g'). Error bars represent standard deviation from 
the mean (n =3). G’, storage modulus; G”, loss modulus. c, Young’s 
modulus and tensile strength for liquid FROMP, gel FROMP, and oven- 
cured pDCPD. A bisphenol A (BPA) aerospace-grade epoxy is shown 

for comparison. Error bars represent standard deviation from the mean 
(n=7). d, Mode I fracture toughness (Kjc, where K is the stress intensity 
factor, I denotes ‘mode I crack opening; and C denotes ‘critical’) for liquid 
FROMP (n=5), gel FROMP (n =5) and oven-cured pDCPD (n=6) in 
comparison with BPA aerospace-grade epoxy (n = 8). Error bars represent 
standard deviation from the mean. e, Young’s modulus and tensile strength 
for conventional-cured (51% volume fraction; n = 10) and FROMP- 

cured (51% volume fraction; nm = 13) pDCPD-FRPCs in comparison with 
aerospace-grade (BPA) carbon/epoxy FRPC (52% volume fraction; n = 8). 
Error bars represent standard deviation from the mean. 


FROMP-cured pDCPD exhibits comparable mechanical proper- 
ties to conventionally (oven) cured pDCPD. The tensile strength and 
stiffness of pDCPD that has been frontally polymerized from both 
the liquid and the gel stage indicate that frontal polymerization pro- 
vides the energy necessary for complete cross-linking of the monomer 
(Fig. 4c). Comparing the tensile properties of pDCPD with those of an 
aerospace-grade bisphenol A (BPA) epoxy system indicates that this 
new curing strategy and matrix resin produce polymers suitable for 
use in high-performance applications. In fact, the enhanced fracture 
toughness of FROMP-pDCPD compared with BPA epoxy (Fig. 4d) may 
provide high-performance composites with superior energy absorption 
and fatigue resistance”’. 

Frontally polymerized FRPCs also exhibit mechanical properties 
comparable to those of FRPCs manufactured using a conventional 
aerospace-grade BPA epoxy system (Fig. 4e). The carbon fibre used in 
our FRPCs was surface-treated to allow interaction with epoxy func- 
tional groups; we envisage that greater tensile properties for pDCPD- 
FRPCs should be made possible by tailoring the interfacial interaction 
of the carbon fibre and pDCPD through proper fibre surface treat- 
ments. We attribute the slightly lower tensile strength of FROMP-cured 
compared with conventionally cured pDCPD composite specimens to 
the lower degree of cross-linking, resulting from heat loss through the 
fibre reinforcement and tooling (Extended Data Table 2). 

In conclusion, we have developed and demonstrated a new frontal 
polymerization curing method for the rapid, energy-efficient manu- 
facturing of thermoset pDCPD and FRPCs. This method is applicable 
to moulding, imprinting, 3D printing and resin-infusion techniques, 
and the resulting polymers and carbon-fibre-reinforced polymer 
composites exhibit excellent mechanical properties, comparable to 
those of materials produced by conventional techniques. We envisage 
that the controlled frontal polymerization of high-quality resins will 
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enable a variety of new manufacturing technologies—such as on-site, 
on-demand manufacturing, in-the-field repair of FRPCs and mould- 
less production—owing to the spatiotemporal control over the curing 
process. 
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Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 

Materials. Dicyclopentadiene (DCPD), 5-ethylidene-2-norbornene (ENB), 
second-generation Grubbs’ catalyst (GC2), phenylcyclohexane, and phosphite 
inhibitors (TMP, TEP and TBP) were purchased from Sigma-Aldrich and used 
as received without further purification. For all FRPC specimens, the fibre rein- 
forcement is Toray T300 carbon fibre 2 x 2 twill weave fabric (tow size 3,000; 
areal density 204g m~*). The resistive heating wire used to trigger FROMP in 
the manufacturing of neat resin panels and in FRPC manufacturing is a 26-gauge 
Kanthal wire (diameter 0.40 mm; resistivity 1.4 x 10-4 Q cm). 

Frontal polymerization. Given that DCPD is solid at room temperature, we first 
melt it in an oven at 35 °C and then add 5 wt.% ENB to depress the melting point. 
All references to DCPD herein refer to this 95/5 DCPD/ENB solution. This mixture 
is then degassed at 16 kPa overnight. In a typical experiment, we weigh out 3.21 mg 
GC2 into an Eppendorf tube and dissolve it in 400 jl phenylcyclohexane. We add 
an appropriate amount of phosphite inhibitor (0-14 molar equivalents with respect 
to GC2) to the solution via a volumetric syringe. The catalyst/inhibitor solution 
is then added to 5g DCPD (10,000 molar equivalents with respect to GC2) and 
thoroughly mixed. 

Unless otherwise specified, the solution is frontally polymerized immediately 

after mixing. Different types and concentrations of inhibitor are used for each 
manufacturing technique to tune the reaction kinetics based on the requirements 
of the target application. The various inhibitor concentrations and resin incubation 
times are summarized in Extended Data Table 1. 
FROMP of stiff gel. We use TBP (2 molar equivalents with respect to GC2) as the 
inhibitor for the free-standing gel specimens shown in Fig. 2a—c. The solution is 
poured into a flat glass plate mould with a polyurethane rubber gasket and allowed 
to polymerize for 18 hours at room temperature. The glass plates are then carefully 
removed and the free-standing gel is extracted from the mould. Gels are then 
either deformed by hand (Fig. 2a, b) or embossed with a plastic stamp (Fig. 2c) to 
create the pattern. A soldering iron is used to locally heat a single point on the gel, 
initiating the FROMP reaction. 

For micropatterned specimens (Fig. 2d, e), a patterned silicon wafer is attached 

to a microscope glass slide using cyanoacrylate adhesive. A second microscope 
slide is used with a polyurethane rubber gasket to form a small glass plate mould. 
The same chemistry described previously for free-standing gel experiments is used 
to fill the mould and the solution is allowed to polymerize for 18 hours. The gel is 
carefully peeled off the micropatterned surface and examined via optical micros- 
copy to confirm pattern transfer. The gel is then suspended pattern-side down 
between two microscope slides and FROMP is initiated by applying a soldering 
iron to one end of the sample. 
Front velocity and temperature measurements. We measure the peak tempera- 
ture during FROMP by inserting a T-type thermocouple (TMQSS, Omega) into 
the liquid resin, gel or FRPC specimens before initiating FROMP. For experiments 
performed in cylindrical test tubes, we insert the thermocouple into the centre 
of the test tube to reduce heat-transfer effects at the test-tube wall. Similarly, for 
experiments performed in composite layups, the thermocouple is inserted at the 
mid-plane of the layup. 

Frontal propagation is tracked with a Canon EOS 7D digital camera or an FLIR 
SC620 thermal infrared camera. Front velocity is extracted from the slope of the 
best-fit trendline for front position versus time. We identify the front position by a 
colour change and refractive-index mismatch in optical images, or a sharp thermal 
gradient in infrared images. 

Rheological measurements. Isothermal rheological measurements are performed 
using a TA Instruments AR-G2 rheometer equipped with 25-mm-diameter parallel 
aluminium plates and a solvent trap. An appropriate amount of phosphite inhi- 
bitor (0-14 molar equivalents with respect to GC2) is dissolved in the monomer 
solution. Time-sweep measurements are performed at 23 °C with a strain of 0.1% 
and a frequency of 1 Hz. The gel times plotted in Fig. 4a correspond to the crossover 
of the storage modulus, G’, and loss modulus, G”, as shown in Fig. 4b. For the 
shear thinning behaviour of the 3D printable DCPD ink (Extended Data Fig. 1d), 
viscosity and stress data are obtained in a flow sweep test with descending shear 
rate at a fixed temperature of —5 °C. 

Heat of reaction and degree of cure analysis. We carry out DSC measurements on 
a TA Instruments Q20 DSC equipped with a CFL-50 cooling system. Samples are 
transferred into aluminium hermetic DSC pans at room temperature and sealed. 
The sample mass is determined using an analytical balance (XPE205, Mettler- 
Toledo). The mass of liquid resin samples is carefully maintained between 2 mg 
and 3 mg because of the highly exothermic nature of the studied reaction, as greater 
masses exceed the instrument's ability to maintain a consistent temperature gradient. 
We determine cure profiles of liquid resins and gels at temperatures between 
—50 °C and 250 °C with constant ramp rates. The enthalpy of reaction is deter- 
mined through the integration of heat flow over the exothermic peak after baseline 
correction. The specific heat capacity is determined between 25 °C and 200 °C by 
comparison with a sapphire standard. 


3D printing. A DCPD/GC2 solution containing 0.5 molar equivalents of TEP 
with respect to GC2 is transferred into a 3 ml syringe barrel and left to stand at 
23.0 °C in an environmental test chamber (MicroClimate, Cincinnati Sub-Zero 
Products) for 160 minutes to allow the cross-linking of the ink mixture to form a 
viscous liquid. Ink in the syringe barrel is then sealed with a piston and is ready 
for printing. The 3D printer consists of an air-operated high pressure dispens- 
ing tool (HP 3CC, Nordson EFD) mounted on a robotic motion-controlled stage 
(JL2000, Robocasting Enterprises). The pressure actuation and stage motion is 
simultaneously controlled via custom-designed software (RoboCAD 2.0). Air is 
supplied from a compressed air tank (Airgas) using a customized pressure con- 
troller. The print bed consists of an aluminium plate heated by a polyimide film 
heater (KH-608, OMEGA) to 70 °C. Glass slides are placed on the print bed to 
capture the prints. The barrel housing is fitted with stainless-steel dispenser tips 
before printing (inner diameters between 0.25 mm and 1.55mm; Nordson EFD). 
The ink barrel is inserted into the dispensing tool, then cooled down to —5 °C + 3 °C 
by surrounding the dispensing tool with dry ice. A few seconds after DCPD ink is 
printed on the heated glass slide, the front initiates via the heated print bed. The 
front then propagates along the filament, following the print head during printing. 
The shear thinning behaviour of the gel (Extended Data Fig. 1d) is crucial for the 
3D printing of free-form structures. Using DSC, we estimate the degree of cure of 
the unprinted ink and printed specimen to be about 23.6% and 99.2%, respectively 
(Extended Data Fig. 1c). 

Fabrication and testing of neat resin panels. We use a resin solution contain- 
ing 1 molar equivalent of TBP with respect to GC2 for the fabrication of both 
FROMP-pDCPD and conventionally cured pDCPD resin panels. The resin 
mixture is degassed for 10 minutes at 10 kPa and then poured into cell casting 
moulds. We use a 215.9mm x 203.2mm x 6.4mm U-shaped polyurethane 
spacer between two glass plates for tensile testing specimens, and a 139.7 mm 
x 127.0mm x 9.5mm U-shaped polyurethane spacer between two glass plates 
for fracture toughness specimens. Panels are manufactured using liquid FROMP, 
gel FROMP, and conventional (oven) cure. Liquid FROMP panels are initiated 
immediately by applying an electric current to a resistive wire placed along one 
edge of the mould. The power supply is turned off as soon as frontal propagation 
is observed, and remains off for the remainder of the propagation. We keep 
gel FROMP panels at 23 °C for approximately 6 hours, at which time the resin 
becomes a rubbery gel, before initiating the FROMP via a resistive wire placed 
along one edge of the mould. The power supply is turned off as soon as frontal 
propagation is observed, and remains off for the remainder of the propagation. 
Conventional-cured panels are cured for 24hours at 30 °C, 2 hours at 70 °C and 
1.5 hours at 170 °C. 

For comparison, we also manufacture panels of an aerospace-grade bisphenol 
A (BPA) epoxy resin. A solution of Araldite LY 8605 (resin) and Aradur 8605 
(hardener) is mixed (100/35 weight ratio) and then degassed for 1 hour at an 
absolute pressure of 10 kPa. The epoxy system is poured into cell casting moulds 
similar to those used for the pDCPD panels. Samples are cured according to the 
manufacturer’s recommended cure cycle, which is 24 hours at room temperature, 
2hours at 121 °C and 3 hours at 177 °C. 

We cut dog-bone specimens for tensile testing from the manufactured panels 

following ASTM standard D638 type I dimensions. Tensile tests are conducted 
using an Instron 5984 with a 150KN load cell and a video extensometer is used 
to measure the strain. Tests are performed at a crosshead speed of 5mm min“!. 
We calculate Young’s moduli over a range of strains from 0.1% to 0.3%. Single- 
edge-notch bending (SENB) specimens for fracture toughness experiments are 
machined from the panels following ASTM standard D5045. Fracture toughness 
tests are conducted using an Instron 8841 with a 1 KN load cell. Pre-cracks are 
created by tapping a razor blade into the edge notch of the sample with a hammer. 
We measure crack lengths optically, using an optical digital microscope (VHX- 
5000, Keyence). A three-point bending fixture is used with a 12 cm span, and tests 
are conducted at a crosshead speed of 10 mm min™!. We calculate Kjc according 
to the referred standard. 
Fabrication and testing of composite parts. For all composite specimens, we 
use 12 plies of dry fabric. For FROMP-pDCPD composites, we mix 0.3 molar 
equivalents of TBP with respect to GC2 with the resin solution. A double-bagged 
VARTM technique is used to infuse the fabric with the liquid resin. To minimize 
heat loss during cure and to mitigate quenching of the front, we prepare the layup 
on a thermally insulating tool plate (448-D, Fibre Glast Developments Corp.). 
A low vacuum (64.3 kPa) is applied on the inner bag using a vacuum pump (2047B- 
02, WelchTM DryFastTM) to infuse the resin. A 1.7kPa vacuum is applied on 
the outer bag using a second vacuum pump to compact the layup for producing 
cured parts with a high volume fraction of fibres. To initiate FROMP, we place an 
electrically resistive wire within the fabric stack and connect it in series toa DC 
power source. Once resin completely infuses the fabric stack, we apply a constant 
electric current across the resistive wire until FROMP initiates in the layup, and 
then turn the power off. 
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We evaluate the effect of three different curing modes on total cure time for a 
10cm x 20cm laminate (Fig. 3a—c). For the in-plane curing modes (Fig. 3a, b), a 
DC power supply powers the embedded wires for 20 seconds at 19 W per wire. For 
the through-thickness curing mode (Fig. 3c), an AC power supply (L1010, Staco 
Energy Products Co.) powers the surface heater (SRFG-408/10, OMEGALUX) for 
32 seconds at a maximum rating of 320 W. We use an FLIR SC620 thermal infrared 
camera to record the surface temperature of the layup and to measure the total cure 
time for each mode. Given that we carry out thermal imaging on the surface of 
the FRPC layup, we also embed a thermocouple inside the layup to measure the 
actual temperature of the FRPC during manufacturing. We found infrared readings 
to be lower than the actual laminate temperature by around 30 °C, owing to heat 
dissipation and radiation by the layup materials. 

We prepare the corrugated composite panel (Fig. 2i) by stacking 12 plies of 
13cm x 23cm fabric on a custom metal tool plate. To minimize heat loss to the 
thermally conductive metal tooling during cure, we preheat the entire layup to 
50 °C ina convection oven before resin infusion. The layup is then removed from 
the oven and immediately infused with resin. Finally, FROMP is triggered using 
the thermal stimulus via the embedded resistive wire. 

We manufacture conventionally cured pDCPD and epoxy composite panels as a 
control, for comparing the mechanical properties of the FROMP-cured composites. 
We fabricate these panels using a traditional wet-layup technique and then cure 
them in a hydraulic press (MTP-13, Tetrahedron) under constant applied platen 
force of 13.3 kN. The matrix material for the conventional-cured pDCPD-FRPCs is 
a resin formulation with a pot life of at least one hour (1.0 molar equivalents of TBP 
with respect to GC2). We use the same cure cycle as for the conventionally cured 
neat pDCPD panels. The matrix material and cure cycle for the epoxy composites 
are the same as for the neat epoxy panels described previously. 

We determine the void content of the fabricated laminates by polishing the 
cross-sections of five to eight specimens taken from each FRPC panel, and imaging 
them using an optical digital microscope (VHX-5000, Keyence). These images are 
then analysed using ImageJ software to calculate the total void area to cross- 
sectional area ratios for each specimen. The void volume fraction for each panel is 
then taken as the average void volume fraction across the analysed specimens. The 
; A, where fa is the 

ps 

areal weight of fabric, n is the number of plies, p¢ is the fibre density and t is the 
average laminate thickness. The quality of different FRPC panels is summarized 
in Extended Data Table 2. 

For the tensile tests, we tab carbon FRPC panels with G10 FR4 glass-fibre/epoxy 
end tabs that are 44.5 mm long, 0.20 mm thick and with a 20° taper. We roughen 
the surfaces of the end tabs and panels at the adjoining areas by lightly sandblasting 
them to increase mechanical grip, and adhere them to each other using JB Weld 
adhesive. The panels are then cut into tensile specimens with nominal dimen- 
sions of 203.2mm x 12.7mm. We test composite specimens according to ASTM 
standard D3039 with a crosshead speed of 2mm min™! on an Instron 5984 with 
a 150 kN load cell and use an Instron video extensometer to measure the strain. 
Young’s moduli are calculated over a strain range from 0.3% to 0.6%. 
Computational modelling. In an axisymmetric setting, the governing partial 
differential equations describing the FROMP reaction can be written as: 


fibre volume fraction, Vg of FRPC panels is calculated as V; = 


JOE MOT OE) 5. op OR. pF 

ar ror ae)” tar ae 
ae Acs| =a a)"a"™ : ; 
T 1+ exp[C(a—a,)] 


where T (in K) and a (non-dimensional) respectively denote the temperature and 
the degree of cure; « (in W m! K7!) denotes the thermal conductivity; C, (in 
J kg! K~!) is the specific heat; p (in kg m~3) denotes the density; H, (in J kg~1) 
is the total enthalpy of the reaction; r and z (in m) are the radial and longitudinal 
coordinates; and f (in s) is the time. The second equation corresponds to the cure 
kinetics model, with A (8.55 x 10! s~1), E (110.75kJ mol7!), R (8.314J mol! K~}), 
n (1.7), and m (0.8) respectively denoting the pre-exponential factor, the acti- 
vation energy, the universal gas constant, and two constants associated with the 
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Prout-Tompkins model that account for the autoactivating effects. The model 
also includes an added diffusion factor with C= 14.5 and a, =0.4 to describe the 
cure kinetics of the monomer along with the effects of diffusion*!. We calculated 
the above constants by solving a nonlinear model-fitting optimization problem on 
the heat of reaction curves obtained by dynamic DSC experiments on a formu- 
lation containing 0.5 molar equivalents of TBP with respect to GC2 catalyst. The 
above governing equations are completed by the following initial and boundary 
conditions: 


T(r,z,t=0)=Th 
a(d0<r<a,z,t=0)=ay 


{T<r<a,z=0,t)=Tyig OXt<t 


rig trig 


SPOS tS m2=0,)=0 t> brig 


Zz 
while adiabatic conditions are imposed at all other boundaries. Tp (23 °C) and ao 
(0.07) are respectively the initial temperature and the degree of cure of the mono- 
mer solution. Tyg is the trigger temperature. The physical and thermal properties 
used in this study are listed in Extended Data Table 3. 

We conducted the numerical analysis using the Multiphysics Object-Oriented 
Simulation Environment (MOOSE)*’, an open source C++ finite-element solver 
that includes mesh adaptivity capability. The system of nonlinear partial differ- 
ential equations is solved at every time step using a combination of the implicit 
Euler time-stepping scheme and a preconditioned Jacobian-free Newton Krylov 
method??*4. 

The first simulation presented in Fig. 3d involves the frontal polymerization of 
DCPD inside a glass tube (Extended Data Fig. 2a). The domain dimensions are 
length (L) =7.5 mm, inner radius (a) =5.5 mm and wall thickness (b) = 1 mm, with 
L chosen to be long enough to capture the quasi-steady-state propagation of the 
front. We use 151,470 four-node quadrilateral elements to discretize the domain 
at the beginning of the simulation, and apply a maximum refinement level of 9 
to adapt the mesh in the vicinity of the advancing front. The polymerization is 
initiated by applying a Trig of 210 °C, for a tirig of 7 seconds, along the left edge of 
the domain (Extended Data Fig. 2a). Extended Data Fig. 2b, c depict the associated 
temperature contours during polymerization. The simulation yields a maximum 
temperature, Tinax, of 223 °C and a velocity of the polymerization front, V, of 64cm 
min |, in good agreement with the experimental results. 

To capture the effect of a carbon-fibre tow as a conductive element on the propa- 
gation of the front, we perform a second simulation (Extended Data Fig. 2d), with 
tow radius (c) =0.5mm, and a, b and L the same as for the case without the tow. For 
this problem, the same mesh is used at the beginning of the simulation; however, 
a maximum refinement level of 12 is applied to capture the exceptionally sharp 
gradients associated with the large mismatch in thermal conductivities between the 
carbon-fibre tow and the DCPD resin (Kce/ resin = 68.75). The reaction is initiated 
by applying a Tyig of 210 °C, for a tirig of 7 seconds. Extended Data Fig. 2e, f, which 
present snapshots of the temperature contours during the frontal-polymerization 
event, clearly shows the effect of the conductive element on the front profile. The 
computed front velocity is 10% higher than in the neat resin case. 

Data availability. The data that support the findings of this study are available 
from the corresponding author on reasonable request. 

Code availability. We performed the simulations using MOOSE, an open-source 
C+-+ finite-element framework developed at Idaho National Laboratory (http:// 
mooseframework.org). 
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Extended Data Fig. 1 | Characterization of DCPD gel and pDCPD approaches, indicating fully cured products with a degree of cure of 99.6%, 
products. a, Heat of reaction, measured by DSC, for the formulation 99.6% and 99.7%, respectively. c, Heat of reaction of the gel before 3D 
presented in Fig. 4b after ageing the resin at 23 °C for the indicated times. printing and of a cured part after printing, measured by DSC. There is 
As the curing proceeds, the exothermic peak shifts slightly to lower minimal heat of reaction in the printed polymer, indicating a 99.2% degree 
temperatures and broadens. b, Heat of reaction of pDCPD specimens of cure. d, Rheological profile of the 3D printable gel, showing shear 
produced by liquid FROMP, gel FROMP and conventional (oven) cure thinning behaviour. 
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Extended Data Table 1 | Inhibitor concentration and resin incubation time for different manufacturing techniques and the corresponding 
front temperature and velocity 


Inhibitor 
concentration 
(molar equiv. 
to GC2) 


Resin incubation 
time 


Front temperature’ 
(°C) 


Front velocity” 
(cm min”) 


~ 215 (0.3-1 equiv.) 


Manufacturing technique ee 
type 
Liquid resin (gel time aes 
measurements) TBP 
Stiff gel FROMP TBP 
3D printing TEP 
FROMP-pDCPD neat panel TBP 
(liquid FROMP) 
FROMP-pDCPD neat panel TBP 
(gel FROMP) 
Conventional-cured pDCPD TBP 
neat panel 
FROMP-pDCPD FRPC TBP 
Conventional-cured pDCPD TBP 


FRPC 


0-14 


0.3 


<1 min 


18 h @ room temp. 
160 min @ 23.0 °C 


<1 min 
6h 

<1 min 
<5 min 


<5 min 


~ 200 (2-4 equiv.) 
~ 175 (8 equiv.)" 
145 


170* 


216 
184 


N/A 
138 


N/A 


See Fig. 4a 


3.6 
9.6 


6.8 
5.7 


N/A 
9.8 


N/A 


TMP: trimethyl phosphite; TEP: triethyl phosphite; TBP: tributyl phosphite. 
“Front temperatures and velocities are highly dependent on the material and geometrical conditions used for each manufacturing technique. The values reported here correspond to the exact 


experimental conditions described in the Methods section. 


tFront temperatures for a given concentration of phosphite inhibitor are similar regardless of the identity of the inhibitor. 
+The front temperature of the 3D printed filament is recorded using a thermal infrared camera, however, it is difficult to determine the exact temperature of the front due to the low distance between 


the heated substrate and printed material. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Table 2 | Comparison of FRPC panels made with different manufacturing techniques 


Manufacturing technique Void content (%) Vs(%) Degree or cure 


(%) 
FROMP-pDCPD FRPC 0.15 + 0.20 513210 805239 
Conventional-cured pDCPD FRPC 1.30 + 0.56 SLIAL3 89.6213 
Conventional-cured epoxy FRPC 0.25 £0.15 S2.221.2 98.824 1.7 
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Extended Data Table 3 | Physical and thermal properties of the various components used in computational modelling 


k(Wm"'K") p(kgm*) A, g') CJ kg' K") 


Monomer 0.15 980 350 1,600 
Carbon fibre 10.45 1,760 _ 795 
Glass 1.14 2,230 _ 800 
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Nanoscale synthesis and affinity ranking 
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Most drugs are developed through iterative rounds of chemical 
synthesis and biochemical testing to optimize the affinity of a 
particular compound for a protein target of therapeutic interest. This 
process is challenging because candidate molecules must be selected 
from a chemical space of more than 10 drug-like possibilities!, and 
a single reaction used to synthesize each molecule has more than 
10’ plausible permutations of catalysts, ligands, additives and other 
parameters’. The merger of a method for high-throughput chemical 
synthesis with a biochemical assay would facilitate the exploration of 
this enormous search space and streamline the hunt for new drugs 
and chemical probes. Miniaturized high-throughput chemical 
synthesis*’ has enabled rapid evaluation of reaction space, but so 
far the merger of such syntheses with bioassays has been achieved 
with only low-density reaction arrays, which analyse only a handful 
of analogues prepared under a single reaction condition® 3. High- 
density chemical synthesis approaches that have been coupled to 
bioassays, including on-bead"*, on-surface!*, on-DNA’® and mass- 
encoding technologies’’, greatly reduce material requirements, 
but they require the covalent linkage of substrates to a potentially 
reactive support, must be performed under high dilution and must 
operate in a mixture format. These reaction attributes limit the 
application of transition-metal catalysts, which are easily poisoned 
by the many functional groups present in a complex mixture, and of 
transformations for which the kinetics require a high concentration 
of reactant. Here we couple high-throughput nanomole-scale 
synthesis with a label-free affinity-selection mass spectrometry 
bioassay. Each reaction is performed at a 0.1-molar concentration in 
a discrete well to enable transition-metal catalysis while consuming 
less than 0.05 milligrams of substrate per reaction. The affinity- 
selection mass spectrometry bioassay is then used to rank the 
affinity of the reaction products to target proteins, removing the 
need for time-intensive reaction purification. This method enables 
the primary synthesis and testing steps that are critical to the 
invention of protein inhibitors to be performed rapidly and with 
minimal consumption of starting materials. 

The merger of nanoscale synthesis with affinity ranking, which we call 
NanoSAR, could enable data-rich interrogations of reaction-chemical- 
bioactivity space (Fig. 1). Nanoscale synthesis consumes less than 
0.05 mg of substrate per reaction’, so precious starting materials such 
as complex drug-like intermediates can be used. Affinity-selection 
mass spectrometry (ASMS) is a sensitive bioassay'*"’ that relates pro- 
tein binding to mass detection of a candidate molecule. ASMS was 
chosen to assay crude nanoscale reaction mixtures directly because 
misleading readouts related to catalysts or other reaction components 
can be omitted as long as the molecular masses of the desired prod- 
ucts and residual reagents can be resolved by high-resolution mass 
spectrometry. However, most existing ASMS protocols provide only 
a binary readout of affinity”. We envisioned that affinity could be 
ranked by titrating down the concentration of protein while main- 
taining a constant concentration of compound. As the protein con- 
centration decreases, competition for binding is induced?!, and only 
compounds with high affinity to the target are observed at the lowest 


protein concentration. With this advance, potent protein inhibitors 
may be identified at a nanomole-scale while forgoing time-consuming 
reaction purification. 

We validated NanoSAR by synthesizing libraries based on three of 
the most popular transformations used in pharmaceutical research”””’. 
Each library contains about 20 compounds and each reaction was run 
on a nanomole scale (less than 0.05 mg). Every analogue was resyn- 
thesized on a traditional scale (about 20 mg) so that bioassay results 
of unpurified nanoscale reactions could be compared directly to data 
obtained from purified products using a conventional approach. The 
crude nanoscale reaction mixtures were analysed by using ultra- 
high-performance liquid chromatography—mass spectrometry (UPLC- 
MS) to evaluate the reaction efficiency, pooled, and submitted to the 
ASMS affinity-ranking assay directly (Extended Data Fig. 1). Next, each 
product was resynthesized on an approximately 20-mg scale using the 
productive reaction conditions identified on the nanoscale and puri- 
fied. These purified products were likewise submitted to the ASMS 
affinity-ranking assay, but were also either submitted to a traditional 
assay of biochemical function or compared to reported binding affini- 
ties. Potent molecules could be identified regardless of the scale of the 
reaction. 

The bioassay results from the crude nanoscale reactions matched 
those obtained from the purified products. For example, a library of 
19 known inhibitors”* of the kinase ERK2 was prepared by allowing 
carboxylic acid 1 to react with diverse amines in the presence of HATU 
(hexafluorophosphate azabenzotriazole tetramethyl uronium) and 
‘Pr,NEt (19 reactions, Fig. 2a) and submitted to affinity ranking. High- 
affinity amide product 2 (inhibitory constant Kj=35 nM), which was 
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Fig. 1 | Nanoscale synthesis and affinity ranking (NanoSAR). 

a, Reaction space and chemical space are navigated at the nanomole-scale 
and bioassayed directly to locate potent protein inhibitors and productive 
reaction conditions simultaneously. b, Nanoscale synthesis consumes 

a fraction of the material that is required for traditional synthesis. 

Crude reactions are assayed by ASMS, with affinity ranking achieved by 
decreasing the concentration of the target protein to induce competition 
among compounds. 
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Fig. 2 | NanoSAR identification of ERK2, MK2 and CHK1 inhibitors. 
Inhibitors were produced using nanoscale synthesis and submitted directly 
to an ASMS affinity-ranking assay, then resynthesized on a 50-j1mol 

scale, purified, and reassayed by ASMS for comparison to the functional 
bioactivity (ICs9) or reported binding affinity (Kj) of each compound. 

a, A library of 19 ERK2 inhibitors prepared by amide coupling. b, A library 
of 18 MK2 inhibitors prepared by Suzuki coupling. c, A library of 20 CHK1 
inhibitors prepared by Suzuki and C-N coupling. In each library, every 
compound has a unique identifier (ID) and was formed under one or 
more reaction conditions, with the yield, determined by UPLC, for each 
reaction depicted in the heat map and the affinity-ranking data depicted 
in the scatter plot. One high-affinity compound (2, 5 and 8) and one 
low-affinity compound (3, 6 and 9) from each library are shown. The 
reaction conditions identified on the nanoscale (red box) were then used 
for the 50-jsmol-scale resynthesis of each exemplary compound, and the 
corresponding affinity-ranking data for that compound are highlighted 
(red circle). NB indicates no binding was observed in the affinity-ranking 
assay at the highest protein concentration tested. See Extended Data 

Figs. 2-4 for reaction conditions (A-I), structures, yields and K; or ICso 
values of all compounds. 


observed at the lowest ERK2 concentration tested in the affinity-ranking 
assay, was easily distinguished from low-affinity amide product 3 
(Kj > 5,000 nM), which was not observed at any of the ERK2 concen- 
trations tested. Therefore, with NanoSAR it was possible to differentiate 
potent from impotent analogues with less than 0.05 mg of material and 
with no reaction purification (Extended Data Fig. 2). 

For this library of ERK2 inhibitors produced by amide coupling, a 
single reaction condition was sufficient. For subsequent transition- 
metal-catalysed Suzuki or C-N coupling reactions, we surveyed 
eight and nine reaction conditions, respectively, to ensure that every 
product was synthesized successfully. Nanoscale Suzuki coupling 
of heterocyclic bromide 4 with 18 diverse boronates was achieved 
with 8 combinations of Buchwald precatalysts”, bases and solvents 
(144 reactions, Fig. 2b). Using conditions identified at the nanos- 
cale, all 18 products were prepared on a traditional 50-mol scale, 
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purified and assayed for inhibition of the function of the kinase 
MK2”*. NanoSAR can readily distinguish potent inhibitors, such 
as 5 (half-maximal inhibitory concentration ICs) =59 nM), from 
impotent inhibitors, such as 6 (ICs 3,400 nM). ICso values for all 
18 analogues were determined, using purified compounds produced 
on a 50-mol scale, and could be predicted from the NanoSAR data 
(Extended Data Fig. 3). Similarly, bromide 7 was used to generate 20 
distinct inhibitors”’ of the kinase CHK1 via C-C and C-N coupling 
(170 reactions, Fig. 2c). Potent inhibitors, such as 8 (IC59 23 nM), were 
readily distinguished from an array of less-potent, lower-affinity com- 
pounds, such as 9 (IC59 > 10,000 nM). Once again, affinity-ranking 
results from crude nanoscale reactions predicted the CHK1 func- 
tional activity of purified samples produced on a micromole scale 
(Extended Data Fig. 4). Samples treated with a metal scavenging resin 
had low levels of residual palladium; however, simple dilution with 
water reduced palladium levels to less than 5 p.p.m. (Extended Data 
Fig. 5). 

The information density produced by NanoSAR permits explora- 
tions that were previously impractical or impossible, such as the rapid 
survey of reaction conditions, building blocks and protein affinity 
(Fig. 3). For example, 7 was coupled to various functionalized nucleo- 
philes via multiple distinct chemical transformations, with affinity 
information for CHK1 generated for each analogue of the form of 10 
produced. As shown in the outer reaction-condition heat map in Fig. 3, 
we explored combinations of catalysts (11-26), bases (27-34) and sol- 
vents (N-methyl pyrrolidinone (NMP), dimethylsulfoxide (DMSO) and 
N,N-dimethylformamide (DMF)) with 17 representative nucleophiles 
(D1-D17, Extended Data Fig. 6). For each class of nucleophile, four of 
the most productive reaction conditions were selected so that 7 could 
be effectively coupled to 384 diverse building blocks, as shown in the 
inner heat map of building-block reactivity (Fig. 3). By using four reac- 
tion conditions, we identified productive C-N, C-O, C-S or C-C cou- 
pling conditions for 345 of 384 products (90%) (Extended Data Figs. 7, 
10). The 345 crude reaction mixtures, outlined in black in the inner 
heat map, were submitted directly to affinity ranking by titrating the 
CHK1 concentration from 101M to 0.2 1M: products that bind tightly 
to CHK] appear closer to the centre of the circle in the affinity-ranking 
plot (Fig. 3). Key compounds were identified for scale-up, purification 
and analysis in an assay of CHK1 functional activity (Extended Data 
Fig. 8). Exemplars (35-41) are shown in Fig. 3 (see Extended Data Fig. 9 
for dose titration curves), with each class of functional group revealing 
unique structure-reactivity and structure-bioactivity relationships. 
Compounds 38, 39 and 41 were isolated in 98%, 52% and 24% yield, 
respectively, from reactions run on a 50-.mol scale, and were potent 
CHK1 inhibitors, as predicted by NanoSAR. 

Subtle reactivity trends were illuminated in the data analysis. For 
instance, one of the thiol nucleophiles that we tested in the initial survey 
of reaction conditions (3-phenylpropane-1-thiol, D15) couples to 7 in 
high yield regardless of the catalyst, base or solvent used (see Extended 
Data Fig. 6 for reaction-condition mapping). This finding suggests that 
alkyl thiols couple to 7 via a nucleophilic aromatic substitution (SyAr) 
mechanism rather than via palladium catalysis. Guided by this infor- 
mation, two of the four reaction conditions selected to couple thiols 
to 7 used no catalyst. Compound 35 was isolated in 98% yield on a 
50-\1mol scale using no catalyst, with P,Et in DMF; however, it was 
inactive against CHK1, as predicted by NanoSAR. 

Higher-order trends also emerge from the data. For example, com- 
paring the reaction performance of boronate to amine building blocks 
suggests that, at least with 7, the Suzuki coupling is a more robust 
reaction than is the C-N coupling. Years of anecdotal experience 
with these two transformations led us to hypothesize this disparity 
in reaction robustness’, and here the phenomenon can be readily 
visualized. Patterns of bioactivity are also apparent in the central 
plot of affinity ranking in Fig. 3. For instance, many of the boronate-, 
amide- and amine-derived products were bioactive, exhibiting affi- 
nity to CHK1 and appearing as points closer to the centre of the circle; 
these compounds exhibited functional inhibition of CHK1 function 
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Fig. 4 | NanoSAR enhances coverage of chemical space. a, When coupling 
7 to diverse nucleophiles (D18-D402), a single reaction condition (11, 27, 
NMP, room temperature) produced 158 (yellow circles) out of 384 (41%; 
see inset pie chart) designed products, with 226 failures (0% yield; purple 
crosses). The purple highlighted regions indicate ‘dark regions of principal 
component (PC) chemical space, where reactions failed. b, Matching each 
nucleophile to the best of four reaction conditions produced 345 out of 384 
(90%) products. e, 21 compounds from the ‘dark’ space that were made 
accessible by NanoSAR bound to CHK1, such as 42-45. CHK1 ICsp values 
are from a functional assay of purified compounds. 


(Extended Data Fig. 9). By contrast, few of the sulfonamide-, thiol-, 
alkyne- or alcohol-derived products bound to CHK1, and these link- 
ages could reasonably be deprioritized from future design cycles. In 
total, only 123 mg of 7 were consumed to run 3,114 reactions, to match 
345 of the 384 diverse nucleophiles with coupling conditions, to assay 
the products and to paint a landscape of reactivity and bioactivity. 

For comparison, coupling of bromide 7 to the same 384 diverse 
nucleophiles under a single reaction condition—the recently reported 
combination of ‘BuXPhos Pd G3 (11)*° with PEt (27) as a base”®, which 
is among the most substrate-tolerant palladium-catalysed coupling 
conditions known’—yielded detectable product (10) for only 158 of 
384 reactions (41%). Therefore, screening reaction conditions by using 
nanoscale synthesis enabled deeper exploration of chemical space, 
with an additional 187 targeted compounds synthesized and assayed 
successfully (Fig. 4). Under a single reaction condition, synthesis was 
unsuccessful for large swaths of chemical space and for compounds on 
the fringes of the space that we studied (Fig. 4a). NanoSAR identified 
21 compounds from this ‘dark space that bound to CHK1 (Extended 
Data Fig. 10). Exemplary compounds were resynthesized at a 50-1mol 
scale, purified and tested in a functional assay, whereupon several prod- 
ucts (such as 42-45) displayed functional inhibition of CHK1 at low, 
nanomolar concentrations (Fig. 4c). As can be seen from Fig. 3, a large 
number of reactions produce no observable product and a large num- 
ber of compounds have no affinity to the protein target, which, in our 
experience, is typical of drug discovery. Machine-learning algorithms 
may one day facilitate the navigation of these problems”; however, 
locating bioactive molecules and conditions for their synthesis for now 
remains an experimental science, which NanoSAR is well positioned 
to accelerate. 


Data availability 

The data that support the findings of this study are available within the paper 
and its Supplementary Information. Source Data for Figs. 2-4 and Extended Data 
Figs. 2-6, 8-10 are available with the online version of the paper. 
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and affinity ranking workflow. a, Reaction solutions are dosed using a 
TTP mosquito liquid-handling robot that handles increments of 20 nl. 
b, c, UPLC-MS analysis confirms the presence of products from crude 
reactions (b), which is visualized in heat maps (c). d, For the ASMS assay, 
productive reactions are mass-encoded, pooled and incubated with a 
protein of interest. e, f, Elution through a size-exclusion column separates 
protein-bound compounds from unbound compounds (e), and binders 


UPLC-MS Analysis 
1-2 minutes/sample 


Separate non-binders 
by gel filtration 


Repeat interesting reactions on ~20 mg scale 
and submit purified product to traditional assay 


LETTER 


High Information Density 
Up to 1536 reactions/experiment 


protein- 
binder complex 


non-binders 


Denature protein-binder complex & 
observe small molecule by MS 


1 
> 
a 


ase 
Bp 
=> T 


K, or IC, 
(purified product, traditional scale) 


oi 


© 
S 


lowest protein concentration binding is 
observed at (crude nanoscale) 


oo 


| 
T T T T 1 


can be observed by high-performance liquid chromatography—mass 
spectrometry (HPLC-MS) following denaturation of the protein-ligand 
complex (f). g, Decreasing the concentration of protein in the ASMS assay 
increases the competition for binding so that ligands can be categorized as 
high, medium or low affinity. h, Scale-up of compounds and purification 
by using a traditional approach. i, Measurement of ICs in a functional 
assay enables comparison of the new method to existing assay technology. 
See Supplementary Information for additional details. 
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Extended Data Fig. 2 | Structures, yields, K, values and ASMS 
comparison plots for the ERK2 library. a, A library of 19 unique ERK2 
inhibitors (2, 3, Al-A17) was produced by coupling 1 to diverse amines 
under a single reaction condition (HATU, ‘Pr,NEt, NMP; see Fig. 2a). 
b, Crude nanoscale reactions were submitted to the ASMS assay, with 
affinity ranking determined by reducing the concentration of ERK2 to 
increase the competition for binding. The lowest ERK2 concentration 

at which the mass signal of the compound was still observed is shown 
on the abscissa. The Kj value reported”! from purified product samples 
generated on a 500-times-larger reaction scale is shown on the ordinate. 
The colour shading groups compounds into low affinity (purple), modest 


affinity (magenta) or high affinity (yellow), as determined by the ASMS 
assay. rxn, reaction. c, Comparison of the results of the ASMS affinity- 
ranking assay (abscissa) to the reported ERK2 K; values (ordinate). Both 
datasets are from purified product samples produced on a 50-j1mol scale. 
d, Comparison of results of ASMS affinity ranking from crude reaction 
mixtures generated on a 100-nmol scale with those from purified product 
samples generated on a 50-|:mol scale. Points are coloured by Kj, and 
jittering was applied to this categorical data to reveal overlapping data. 

e, Product structures with reaction conditions used, isolated yields and K; 
values. See Supplementary Information for additional details. 
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submitted to the affinity-ranking assay following treatment with SiliaMetS 
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for additional details. 
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; (D3 is compound C5 in Fig. 2c; 4p is compound C6 in 


Fig. 2c; ‘D5 is compound 8 in Fig. 2c; ‘prepared from the boronic acid. 


D1 is compound C2 in Fig. 2c; D2 is compound C3 in Fig. 2c 
See Supplementary Information for additional details. 


from this screen were selected for use in the subsequent library synthesis 


campaign. 
and 43 in Fig. 4 


(11-26) and bases (27-34) in NMP, DMSO or DMF solvent. b, The details 
beside the heat maps show the mapping of nucleophiles, solvents, catalysts 


and analysed by UPLC-MS for conversion to products of the form of 10 


nucleophiles (D1-D17) were screened against combinations of catalysts 
and bases. NA, no catalyst used. Reactions were run on a 100-nmol scale 
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322 additional nucleophiles were used generating 285 additional products. 


See Supporting Information. 


Extended Data Fig. 7 | Exemplary compounds from the synthesis of 
the library. See Fig. 3. Structures for 62 diverse coupling products (of 

the form of 10) are shown, which were selected from the 345 productive 
reactions identified in a library synthesis campaign targeting 384 products 


(Fig. 3). Diverse nucleophiles were coupled to 7 using the four reaction 
conditions selected in Fig. 3 and Extended Data Fig. 6. See Supplementary 
Information for additional details. 
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by reducing the concentration of CHK1 to increase the competition for 
binding. The axes of the plots are as in Extended Data Fig. 4b, c. Points 
are coloured by the ICs» value for inhibition of CHK1 function (as in 

Extended Data Fig. 4d). See Supplementary Information for additional 


Extended Data Fig. 8 | Comparison of the results of affinity ranking 
with IC; values for inhibition of CHK1 functional activity. The 62 
exemplary compounds (Extended Data Fig. 6) were selected from 345 that 
were submitted to affinity ranking (Fig. 3). Crude nanoscale reactions 
were submitted to the ASMS assay, with affinity ranking determined details. 
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used. The dose response curves and ICs9 values shown were measured 
on purified product samples generated on a 50-|1mol scale and could be 
predicted from affinity-ranking results of crude nanoscale reactions as 
shown in Extended Data Fig. 8. See Supplementary Information for dose 
response curves of additional compounds. 


Extended Data Fig. 9 | Dose response curves for exemplary compounds. 
The compounds shown are 35-41 and 43. The inhibition of CHK1 
functional activity by diverse coupling products (10) from the reaction 

of 7 with nucleophiles under various reaction conditions is shown. 
Isolated yields were achieved when the reaction conditions shown were 
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Extended Data Fig. 10 | Reaction metrics and chemical ‘dark space. 

a, Histogram of reaction performance for 384 diverse coupling reactions 
with bromide 7 using a single reaction condition (10 mol% ‘BuXPhos 

Pd G3, P2Et, NMP). Only 158 of the 384 targeted products were 
observed by mass spectrometry and the majority of the reactions failed 
(0% conversion to product). b, Histogram of reaction performance for 
384 diverse coupling reactions with bromide 7 using the best of four 
reaction conditions, as described in Extended Data Figs. 6 and 7. Of the 
384 targeted products, 345 were observed by mass spectrometry, with a 
more even distribution of reaction yields and the majority of reactions 
succeeding (100% conversion to product). c, Principal-component (PC) 
analysis of chemical ‘dark’ space, with each dot representing a compound 
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that was not formed under a single reaction condition (0% yield) but that 
had affinity to CHK1. By using the best of four reaction conditions, 187 
additional products were produced and assayed that bound to CHK1; 
the colour of the dots reflects the affinity ranking of the compounds. 

The boundaries of this space are identical to those depicted in Fig. 4a, in 
which purple shading highlights additional regions where the majority of 
reactions failed (0% yield). Areas where dots are shown in purple shaded 
regions depict products with affinity to CHK1 that were formed in 0% 
yield in a but in more than 1% yield in b. d, Potent CHK] inhibitors that 
were produced in 0% yield under a single condition (10 mol% ‘BuXPhos 
Pd G3, P2Et, NMP), but in modest to good yields following reaction- 
condition screening. 
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Over 60 years ago, stone tools and remains of megafauna were 
discovered on the Southeast Asian islands of Flores, Sulawesi and 
Luzon, and a Middle Pleistocene colonization by Homo erectus was 
initially proposed to have occurred on these islands'~*. However, 
until the discovery of Homo floresiensis in 2003, claims of the 
presence of archaic hominins on Wallacean islands were hypothetical 
owing to the absence of in situ fossils and/or stone artefacts that 
were excavated from well-documented stratigraphic contexts, 
or because secure numerical dating methods of these sites were 
lacking. As a consequence, these claims were generally treated with 
scepticism®. Here we describe the results of recent excavations at 
Kalinga in the Cagayan Valley of northern Luzon in the Philippines 
that have yielded 57 stone tools associated with an almost-complete 
disarticulated skeleton of Rhinoceros philippinensis, which shows 
clear signs of butchery, together with other fossil fauna remains 
attributed to stegodon, Philippine brown deer, freshwater turtle 
and monitor lizard. All finds originate from a clay-rich bone bed 
that was dated to between 777 and 631 thousand years ago using 
electron-spin resonance methods that were applied to tooth enamel 
and fluvial quartz. This evidence pushes back the proven period of 
colonization’ of the Philippines by hundreds of thousands of years, 
and furthermore suggests that early overseas dispersal in Island 
South East Asia by premodern hominins took place several times 
during the Early and Middle Pleistocene stages!~*. The Philippines 
therefore may have had a central role in southward movements into 
Wallacea, not only of Pleistocene megafauna’, but also of archaic 
hominins. 

The most recent recoveries in Flores®*? and Sulawesi!° (Indonesia) 
provide a unique documentation of overseas hominin dispersal during 
the early Middle Pleistocene epoch. An early presence in the Philippine 
archipelago has been hypothesized since the 1950s, with the reporting 
of presumably Pleistocene megafaunal remains and ‘Palaeolithic’ indus- 
tries consisting of chopping tools and flakes (the ‘Cabalwanian’ and 
‘Liwaniar’ industries, respectively) from surface finds and excavations 
in the Cagayan Valley basin of northern Luzon*. Despite the fact that 
these early discoveries took place more than 60 years ago, no direct 
association between megafauna and lithic industries has been docu- 
mented since then, and no secure numerical dating of both fossil fauna 
and lithics has been available for this region'!. To date, the discovery of 
a human metatarsal in Callao Cave in northern Luzon®, directly dated 
to 66.7 + 1.0 thousand years ago (ka), represented the oldest evidence 
of the peopling of the Philippines. 


In 2013, a survey of the Cagayan Valley near the Rizal Municipality 
(Kalinga Province) led to the discovery of a concentration of vertebrate 
bones and stone artefacts scattered on the surface near what became 
our new excavation site. The Kalinga site (17° 33’ 45.0318” N, 121° 33’ 
35.7372" E) (Fig. 1b) has been excavated annually since 2014 and has 
resulted in the discovery of in situ megafauna and associated stone 
artefacts. The substrate consists of the upper part of the Awidon Mesa 
Formation, a 400-m thick sequence of alluvial stream deposits (mainly 
sandstones and claystones) intercalated with volcaniclastic and pyro- 
clastic layers (Fig. 1a). These sediments were deposited on an alluvial 
fan system in braided streams of the paleo-Chico River as a conse- 
quence of uplift in the Central Cordillera to the west!*!°. During a 
poorly constrained Pleistocene phase of folding in response to east- 
west compression, alluvial fan deposition in the Kalinga area came to 
a halt. 

We conducted the main 16-m? excavation at the head of a modern, 
dry stream valley, north of a small hill and down to a maximum depth 
of 2m (Fig. 1c and Supplementary Information). A 25 x 1-m? slot 
trench was excavated down the hill to the main excavation. Together, 
these excavations revealed a total of 7.5 m of stratigraphy comprising 
four main sedimentary units, in ascending order: unit A, unit F, unit G 
and unit J (Fig. 1d, e and Extended Data Fig. 1). An almost-complete 
disarticulated skeleton of R. philippinensis (Extended Data Fig. 2) was 
found embedded in the basal sediments of unit F lying across the base 
of an erosional channel surface that cuts down vertically into sandy unit 
A. This channel was filled with an up to 3.25-m thick mudflow (unit F; 
see Extended Data Fig. 3, 4), which covered the bones, along with an in 
situ tektite as well as 57 stone tools and sparse fossils of other animals 
(Geoemydidae, Varanus cf. salvator, Stegodon cf. luzonensis and Cervus 
cf. mariannus) (see Supplementary Information). The archaeological 
layer (unit F) is conformably overlaid by an approximately 1.15-m 
thick, sterile, cross-bedded coarse sandy fluvial unit with silty lenses 
(unit G), which is in turn conformably overlaid by unit H, a 2.5-m thick 
silty pedogenized layer with rhyzoliths. 

The 57 stone artefacts account for six cores, 49 flakes and two 
possible hammer stones that all originated from unit F (Fig. 2 and 
Supplementary Information). With the exception of the two possible 
hammer stones (Fig. 2b), all artefacts lack a patinated lustre and have 
a fresh appearance, indicating that any transport was minimal. The 
knapping strategies were oriented towards short and unorganized 
core reduction, resulting in non-standardized flake morphologies 
and dimensions, and all artefacts lacked any intentional retouch. 
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Fig. 1 | Geology and sedimentology of the Kalinga Excavation site. 

a, Digital elevation map of the Cagayan Valley surrounding the Rizal 
municipality (located in b, Northern Luzon Island, east of Huxley’s and 
north of the Wallace Lines). The Kalinga site (red star) is located at the 
southern tip of the weakly folded Cabalwan Anticline. Geological units 
of the area bounded by the Cagayan River on the east and the Chico 
River on the west are after Mathisen'?. Stratigraphically, the site layers 
pertain to the upper part of the Awidon Mesa Formation, a Pleistocene 
sequence of alluvial stream deposits intercalated with volcaniclastic and 
pyroclastic deposits. The depositional environment of the Awidon Mesa 
Formation was characterized by braided rivers on an alluvial fan system 
that formed in response to uplift in the Cordillera Central to the West!””?. 
c, Contour map of the main excavation and the adjoining trench H along 
the small valley where the Kalinga site is located. d, Detailed stratigraphy 


The Kalinga lithic assemblage is diverse in its techniques, technology 
and final products, and appears similar to the chert industry described 
at the Arubo | site’* (see Supplementary Information). Also recovered 
from the unit F excavation area was a 600-g pebble among hundreds 
of pebbles that were all lighter than 200 g, and which we interpret as a 
possible manuport. 

Among the more than 400 bones recovered from unit F, the 
most striking remains were of a disarticulated, approximately 
75% complete skeleton of a single R. philippinensis individual 
(Fig. 3 and Extended Data Fig. 2). The bones were found lying 
on top of the erosional surface down-cutting unit A, and were 
embedded in the basal clay-rich sediments of unit F along the 
deepest part of the paleo-channel bed (Extended Data Fig. 3). 
Although none of the rhinoceros bones were found articu- 
lated, the recovered skeletal elements occur within a 3 x 2-m? 
area, suggesting that disarticulation occurred sub-aerially and 
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of the excavation with the absolute ages of the sedimentary units. Unit A 
constitutes a fining upward complex of sandy to silty cross-bedded fluvial 
sediments. The top of unit A is eroded and cuts down vertically over 

at least 2.5m. This erosive channel is filled with unit F, a poorly sorted 
mudflow deposit with a maximum thickness of 3.25 m. The rhinoceros 
skeletal elements and most of the stone artefacts were found lying directly 
above the erosional contact, and were found embedded in the clay-rich 
mud of unit F. Unit F is conformably overlaid by a sequence of horizontally 
layered coarse sandy to silty layers (unit G), which is in turn conformably 
overlaid by a thick sequence of silty deposits overprinted by palaeosols 
(unit H). e, Southward view of trench H showing the lower and upper 
contacts between mudflow unit F and sandy unit G and between sandy 
unit A and mudflow unit F. 


that transport prior or during deposition of mudflow unit F was 
minimal. 

Thirteen of the excavated rhinoceros bones, all of which in life had a 
thin cover of soft tissue (that is, the ribs and metacarpals)'*"°, display 
cut marks (Fig. 3 and Extended Data Fig. 2). Both rhinoceros humeri 
have similar percussion marks on the anterior surface for the right 
humerus and on the posterior surface for the left humerus, and both 
were presumably made with the intention to smash the bones and gain 
access to the marrow’. This percussion action resulted in the break- 
age of the left humerus into five pieces, which is the only bone found 
fragmented; however, the fragments were still clustered together within 
a small 1-m? area of the excavation. On the right humerus, however, 
percussion did not result in the fragmentation of the bone (Fig. 3). 

To constrain the age of the bone bed and the stone artefacts it con- 
tained, we applied three different dating methods to various materials 
(Fig. 1). Single crystal #°Ar/*’Ar dating was applied to plagioclase 
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Fig. 2 | Lithic artefacts from Kalinga. a, Cortical flake on oe 
(II-2014-J1-362; length (L) = 100 mm, breadth (B) = 55 mm, thickness 
(T) =33 mm). b, Possible hammerstone on dacite (II-2014-J1-371), 
although its highly eroded aspect precludes any definitive conclusion. 
Arrows indicate crushed areas interpreted as the result of precussions. 

c, Siret kombewa flake on jasper (II-2014-J1-391; L=40mm, B=18mm, 


crystals from the sandy units directly below and above the archaeological 
unit F and yielded two statistically undistinguishable weighted mean 
dates of 1,050 + 28 ka and 1,007 +29 ka, respectively (1a confidence 
interval; Supplementary Information and Extended Data Fig. 5). These 
“°Ar/*?Ar dates yielded an age for the formation of the volcanic plagio- 
clase crystals. Quartz grains from the same two sandy units were also 
dated using the electron spin resonance (ESR) method", and resulted in 
a maximum depositional date of 727 + 30 ka for unit A, anda minimum 
depositional date of 701 +70 ka for unit G (1o confidence interval; 
see Supplementary Information). 

To directly constrain the age of the rhinoceros skeleton and the cut 
marks, we applied ESR/uranium-series dating to the enamel of the 
rhinoceros’s right maxillary third premolar from the unit F bone bed. 
The tooth yielded an age of 709 + 68 thousand years (1o confidence 
interval), which is in agreement with the ESR results on the quartz 
(Fig. 1, Extended Data Fig. 6, Extended Data Table 1, Supplementary 
Information and Supplementary Table 1). In addition, a palaeomagnetic 
sample was taken from a laminated silty lens in the lower part of unit G 
and was found to have a normal magnetic polarity (see Supplementary 
Information: and Extended Data Fig. 7). The presence in unit F of a 
reworked Australasian tektite (see Supplementary Information and 
Extended Data Fig. 8) that had formed during a major meteoritic 
impact just before the onset of the Brunhes Normal polarity epoch 
at 781 ka!®°, also provides further support for these closely grouped 
dating results. These results further suggest that the volcanic plagioclase 
crystals from unit G on which the “°Ar/**Ar date was obtained were 
reworked from older volcanoclastic deposits, and therefore provide 
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T=8 mm) that has a longitudinal and oblique fracture on the inferior 
two-thirds of the left side resulting from a knapping accident while flaking. 
d, Double-backed flake on flint (II-2014-J1-519). e, Core on quartz 
(II-2014-J1-396), with clear marks of knapping on an anvil, and its 
diachritic diagram. Arrows indicate the percusion axes. 


a maximum age for the sequence (see Supplementary Information). 
Taken together with the ESR dating results, it follows that the rhinoc- 
eros skeleton was buried by a mudflow at least 631 ka. 

Our excavations at Kalinga and the numeric dating results clearly 
provide securely dated evidence for human colonization of the 
Philippines by the early Middle Pleistocene epoch, and long before 
the appearance of modern humans in both the local context and wider 
Island South East Asia region?!. Although the identity of these archaic 
toolmakers remains unknown, it is likely that they dispersed over at 
least one sea barrier to reach Luzon Island”. The most likely points 
of origin are Borneo through Palawan to the west, or China through 
Taiwan to the north, this latter island was connected to mainland Asia 
during periods with low sea levels’. The Middle Pleistocene fauna from 
the Awidon Mesa Formation contains a wider range of vertebrates than 
the Pleistocene faunas from two islands to the south of the Philippines 
that have both yielded evidence of the occupation by premodern 
humans, Sulawesi’ and Flores”* (Extended Data Fig. 9). Overseas dis- 
persal throughout Wallacea of land mammals, including hominins, 
could have been primarily, although not exclusively, in a north to south 
direction, following the major surface current flow patterns. 

Beyond the chronological gap that is yet to be filled, a question clearly 
linked to our discovery is the origin of the Callao Cave hominin that 
has been dated to 66.7 +1 ka. This diminutive Callao hominin may 
represent a direct descendent from a Pleistocene migration stock 
related to these early Kalinga toolmakers—similar to what happened 
on Flores Island—or may be derived from a more recent migration 
wave of anatomically modern humans®*!?*9, 
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Fig. 3 | Different types of marks at the surface of the bones. a, Left 
humerus (II-2014-J1-368) found broken into five fragments in the 
excavation with an anthropogenic conchoidal percussion mark (P) on its 
anterior surface most likely produced to get access to the marrow. 

b, The sub-complete rib (II-2014-J1-475) has a diagnostic anthropogenic 
cutmark with a V-shaped cross-section (V), hertzian cones (H), 
asymmetrical profile (A) and shoulder effect (S) on its lateral surface 
resulting from defleshing. Black stains are also present inside the cutmark, 
which resemble the ones observed on the surface of the rib and are the 
result of taphonomic processes that occurred after this cutmark was 
made (see page 156 of Fernandez-Jalvo and Andrews’’). c, Rib fragment 
(II-2014-J1-403) with a shiny surface on its lateral face resulting from 
multiple multidirectional striations, which are presumably caused by 


Despite the current evidence, it still seems too farfetched to suggest 
that H. erectus, or another unknown Pleistocene ancestral candidate for 
the Kalinga toolmakers (for example, Denisovans”’), were able to con- 
struct some sort of simple watercraft and deliberately cross sea barriers 
to reach these islands”®. However, considering evidence of overseas dis- 
persal during the Middle Pleistocene stage is increasing in number??””, 
such a hypothesis cannot currently be rejected. 


Data availability 
The datasets generated and/or analysed during the current study are available from 
the corresponding authors upon reasonable request. 


Online content 
Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
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trampling. d, Right metacarpal IV (II-2014-J1-282) with parallel and 
rectilinear anthropogenic cut marks (R) on its medial surface presumably 
generated during disarticulation. e, Right humerus (II-2014-J1-289) with an 
anthropogenic conchoidal percussion mark (P) similar in size and shape to 
the percussion mark on the left humerus, but located on its posterior surface 
and more distally and associated with a small adhered bone flake (F) (see 
page 298 of Fernandez-Jalvo and Andrews'’). f, Three-dimensional surface 
topography of another rib (II-2014-J1-466) showing a linear mark (on the 
left) with V-shaped cross-section (V) of anthropogenic origin, as well as 
hertzian cones (H) and a linear mark (on the right) with a base as broad as 
the heights of the walls of the grove, commonly attributed to trampling but 
also with assymetrical walls and possible microstriations in the bottom (M) 
of the groove, commonly attributed to anthropogenic marks. 


are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0072-8. 
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Extended Data Figure 1 | Geology and sedimentology of the Kalinga 
Excavation site. a, Detailed stratigraphic drawing of trench H also 
showing the east wall of the S quadrant of the main excavation. The 
sedimentary patterns are the same as in Fig. 1. Representative logarithmic 
grain-size diagrams are shown for samples from each sedimentary unit. 
b, Detailed stratigraphic drawing of the main excavation walls. 

c, Overview towards the northwest of the quadrants N and NW of the 
main excavation in 2015. The concentration of faunal remains and stone 
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tools lying at the base of unit F is exposed, just above the eastward sloping 
erosional contact with unit A. d, Detailed view of quadrant NW showing 
the position of a flake lying next to the rhinoceros left femur. e, Detail 

of quadrant N showing the piece of waterlogged wood fragment (yellow 
outline) recovered near a rhinoceros rib extremity (blue outline). f, Detail 
of quadrant NE showing the tektite recovered in unit F along with the 
faunal and lithic remains. 
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Extended Data Figure 2 | Faunal remains from Kalinga archaeological skeleton. We estimate that about 75% of the skeleton has been recovered. 
unit F. a, Drawing showing the preservation of the rhinoceros and position _ b, Fibula of Varanus salvator. c, Radius of a Cervidae. d, Molar of Cervus 
of the taphonomical marks. A total of 97 fragments of ribs of all sizes have —_cf. mariannus. e, Molar fragment of Stegodon cf. luzonensis. 

been recovered and not all of them could be clearly positioned on the 
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Extended Data Figure 4 | X-ray diffraction pattern of powdered unit 

F clays. Quartz corresponds to the bipyramidal quartz crystals, some of 
which are visible to the naked eye. Bipyramidal quartz, albite, hornblende, 
nontronite and saponite all have a volcanic origin. Albite is a plagioclase 
feldspar frequent in pegmatites. Hornblende is a common silicate mineral 
in igneous and metamorphic rocks. Nontronite is an iron-rich smectite 


type of clay, which can be produced by hydrothermal alteration. Similarly, 
the smectite mineral saponite results from alteration of volcanic glass. The 
mineral composition of unit F supports the interpretation as a mudflow set 
in a volcanic environment. CPS, counts per second; Cu K-a corresponds 
to the wavelength. 
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Extended Data Figure 6 | Measurements of dose rates and calculation of 
equivalent dose to compute the ESR ages from quartz crystals and tooth 
enamel. a, Al and Ti centre ESR spectra of natural and bleached aliquots 
for unit A quartz showing that the Al signal is not completely reset, 
although it is not measurable because it is extremely weak and concealed 
by noise. b, ESR dose-response curve obtained for the rhinoceros tooth 
(archaeological number: II-2014-J1-095; sample code: CGY1501). The 
equivalent dose (De) was extrapolated using a single saturating exponential 


31. Duval, M. & Grin, R. Are published ESR dose assessments on fossil tooth 
enamel reliable? Quat. Geochronol. 31, 19-27 (2016). 
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function (Origin Microcal software) following the recommendations of 
Duval and Griin*! (D, < 500 Gy, therefore Dinax < 5,000 Gy for this sample). 
c-f, ESR dose-response curves, for the aluminium (Al) and titanium- 
lithium (Ti-Li) centres of layers G1 and A4. b-f, The vertical bars indicate 
the standard deviation around the mean for each measurement. The 

red curve is the dose-response curve. The blue curves indicate the 95% 
confidence interval of the dose-response curve. 
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Extended Data Figure 7 | Progressive demagnetization curves, 

equal area projection stereoplots and Zijderveld diagrams for the 

six analysed specimens from Layer G). a-f, Each panel shows the 
progressive demagnetization curves (bottom left), equal area projection 
stereoplots (top left) and Zijderveld diagrams (right). Open circles in 

the Zijderveld diagrams represent the inclination whereas closed circles 
represent declination. Open and closed circles in the equal area projection 
(stereoplot) represent the upper and lower hemisphere, respectively. 


a-e, Specimens were treated by alternating field demagnetization. 

f, Specimen HT-1E was treated by thermal demagnetization. 

g, h, Equal area projections of the mean chemical remanent magnetization 
(ChRM) directions of all analysed specimens before (g) and after (h) 
demagnetization. Solid squares represent the upper hemisphere. The black 
cross indicates the mean ChRM direction of the six specimens combined, 
surrounded by the «95 circle. The red cross represents the present-day 
magnetic direction. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


b g 
= 
i= 
2 
m 
ro} 
a 
= 
° 
oO 
2 
z 
2 
o 2 
3) 8 
2 g 
= a 
5 < 
= 
10 
Kalinga glass composition (wt%) 
2 3 4 5 6 7 8 9 10 11 
Energy (keV) 
Extended Data Figure 8 | Analysis of the Kalinga tektite recovered from _ glass composition through ,.XRF to an australasite tektite from China 
the archaeological layer unit F. a, Picture of the tektite. b, Micro-X-ray measured in the same conditions (red squares) and an average australasite 
fluorescence (XRF) spectra (un-indexed peaks correspond to the Rh composition (blue diamonds)*. 


source) showing its composition. c, Comparison of the Kalinga tektite 
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Extended Data Figure 9 | Fauna diversity of the Kalinga site compared 
to contemporaneous faunas from other islands in the region. 
Contemporaneous to those faunas are ‘classic’ H. erectus faunas on 
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the putative ancestor of H. floresiensis**, and the Walanae Fauna on the 
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Extended Data Table 1 | ESR/U-series results for the rhinoceros tooth 


a 
U 
Sample Tissue 
(ppm) 
dentine 
CGY1501 0.1692 
(R-2014-J1- efiainal 0.2356 + 
095) 0.0001 
b 
Palaeodose 
Sample Tissue De 
(Gy) 
IPH-2015-09 
dentine 
CGY1501 486.50 + 
(R-2014-J1- siainell 23.06 
095) 


Apparent U- 
BY 238Y 2307 R?22Th 2307 R/234U 20Ry/30T, Series ages! 
(ka+2o) 
IPH-2015-09 52.3336 + 1.3701 + 0.8950 + 
0.0062 141.40 0.0051 1.000 220.70 + 3.28 
1.2564 + 0.8147 + 
0.0009 48.96 0.0014 0.976 164.86 + 0.96 
U uptake Daa Dap? Dapi Dap Dai Diz Da BSkiU series 
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parameter Internal ! denttsed dentine sediment Y cosmic total us‘ 
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a, U-series data for the tooth CGY1501. b, ESR/U-series data and age. Eventual radium and radon losses from the dental tissues were estimated from cross-checked + and a data’. Dose conversion 
factors were used according to previously published data®®. Water contents of O+0%, 7+5% and 10+5% were used for enamel (fixed), dentine (fixed) and sediments (difference in mass between the 
natural sample and the same sample dried for a week in an oven at 50 °C), respectively. 


1A k-value of 0.13 +0.02 was used following a previ 


ously published study?8. 


?The enamel thickness removed during the preparation process was taken into account in the 6 contribution calculation®%. The following values were used for the age calculation: 1,505+376,m, 
351+488,m and 45+ 12\m for the initial thickness, thickness after preparation on the dentine side and on the sediment side, respectively. 

3The cosmic dose was estimated based on the depth using Prescott and Hutton’s formulae*°. Because we have at present no means to know precisely when the erosion took place and since when the 
archaeological material became buried under less than 7 m of sediment, a depth of 2.75 m was used for the cosmic dose rate estimation as an intermediate value between the 7 m of sediments that 
once covered the archaeological layer and the present 70cm to 1.20 m depth at which the archaeological material was recovered, and as an average between the once full thickness of the archaeolog- 


ical layer and the present thickness from which the 


archaeological material was recovered. A cosmic dose estimated from a depth of 7m would result in a 10% older age for unit F and a cosmic dose 


estimated from a depth of 1m would result in a 7% younger age for unit F. 
4Uncertainties on the ESR/U-series ages were calculated using Monte Carlo approach*!. 
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Lethal trap created by adaptive evolutionary 
response to an exotic resource 


Michael C. Singer!?* & Camille Parmesan! ** 


Global transport of organisms by humans provides novel 
resources to wild species, which often respond maladaptively. 
Native herbivorous insects have been killed feeding on toxic exotic 
plants, which acted as ‘ecological traps’!~*. We document a novel 
‘eco-evolutionary trap’ stemming from the opposite effect; that 
is, high fitness on an exotic resource despite lack of adaptation to 
it. Plantago lanceolata was introduced to western North America 
by cattle-ranching. Feeding on this exotic plant released a large, 
isolated population of the native butterfly Euphydryas editha 
from a longstanding trade-off between maternal fecundity 
and offspring mortality. Because of this release—and despite a 
reduced insect developmental rate when feeding on this exotic— 
Plantago immediately supported higher larval survival than did 
the insects’ traditional host, Collinsia parviflora®. Previous work 
from the 1980s documented an evolving preference for Plantago 
by ovipositing adults®. We predicted that if this trend continued 
the insects could endanger themselves, because the availability of 
Plantago to butterflies is controlled by humans, who change land 
management practices faster than butterflies evolve®. Here we 
report the fulfilment of this prediction. The butterflies abandoned 
Collinsia and evolved total dependence on Plantago. The trap 
was set. In 2005, humans withdrew their cattle, springing the 
trap. Grasses grew around the Plantago, cooling the thermophilic 
insects, which then went extinct. This local extinction could 
have been prevented if the population had retained partial use of 
Collinsia, which occupied drier microhabitats unaffected by cattle 
removal. The flush of grasses abated quickly, rendering the meadow 
once again suitable for Euphydryas feeding on either host, but no 
butterflies were observed from 2008 to 2012. In 2013-2014, the site 
was naturally recolonized by Euphydryas feeding exclusively on 
Collinsia, returning the system to its starting point and setting the 
stage for a repeat of the anthropogenic evolutionary cycle. 

In his 1999 address to the Ecological Society of America, Gary Polis 
described Homo sapiens as a ‘ubiquitous keystone pest, and so we have 
proven to be’. Although adaptation to human activities enables some 
wild species to coexist with us®, many fail to adapt to human land man- 
agement and suffer in consequence. An example of this is the setting 
by humans of ecological traps”'°, which are defined as follows: “in an 
environment altered suddenly by human activities, an organism makes 
maladaptive habitat choices based on formerly reliable environmental 
cues, despite availability of higher quality habitat”!”. 

Most ecological traps result from preference by wild organisms for 
novel resources that are unsuitable or toxic. Australian monitor lizards 
suffered population crashes after feeding on toxic exotic cane toads!!. 
Bees have maladaptively preferred crops grown from seed treated with 
neonicotinoid insecticides’”. Several examples have involved insect her- 
bivores feeding on exotic host plants that reduced insect fitness!~* but 
fell short of causing population extinctions, because traditional hosts 
were still used alongside the exotics. 

In the ecological trap scenario, the novel resource is accepted as food 
but is initially detrimental!®, with the expectation that evolution should 


lead either to behavioural avoidance of the resource or physiological 
ability to use it?. Here we describe a different type of trap formed by 
a novel resource that immediately supported such high fitness that a 
butterfly population evolved complete dependence on it, causing local 
extinction when humans withdrew our apparent gift. This paradoxical 
phenomenon is previously undescribed and does not fit published 
definitions of either ecological trap or evolutionary trap’'°. We here 
describe it as an eco-evolutionary trap. 

Our study insects—the thermophilic!’ Edith’s checkerspot butter- 
flies, E. editha—formed a sedentary, isolated population around the 
margins of a spring-fed meadow (Schneider’s Meadow; hereafter 
‘Schneider’) at 1,800 m elevation in Carson City, Nevada. This species 
has one generation per year. The novel host of this population was 
the exotic perennial, P. lanceolata, and their traditional host was the 
ephemeral native annual, C. parviflora (Extended Data Fig. 1). 

The geographically closest E. editha populations of the same ecotype 
as Schneider, but where Plantago had not arrived, used Collinsia as their 
sole host®. When neonate larvae from one of these ‘ancestral’ popula- 
tions were transplanted to Schneider, their survival on Plantago was 
identical to that of the local Schneider insects’. The ancestral popula- 
tions were ready to use Plantago from the moment of its introduction. It 
is not surprising, then, that this exotic has been colonized twice by other 
North American Euphydryas'*!°. Oregon E. editha taylori are now 
dependent on the exotic, although it is not clear if this is due to evolu- 
tion of the butterfly, because the original host(s) have disappeared’>. 

During the 1980s, survival of E. editha at Schneider was consist- 
ently higher on the exotic than on the traditional host (Table 1), despite 
larval growth being about 18% slower on Plantago°. In situations in 
which the principal host is ephemeral (as is Collinsia), female E. editha 
face a trade-off between maternal fecundity and offspring survival. 
Prolonging larval development can increase fecundity but the resulting 
delay in adult emergence augments the risk of offspring mortality from 
host senescence’®, The evolutionary response to this trade-off has been 
to delay emergence to the point that many offspring routinely starve 
from phenological asynchrony with their hosts’®. The stage is set for the 
time constraint to be released and fitness increased by host-switching 
to the longer-lived Plantago, despite slower larval growth on it. Indeed, 
the majority of larval mortality observed on Collinsia was from host 
senescence, whereas Plantago did not senesce during the seasons of 
larval activity. 

No adult females sampled from ancestral populations preferred 
Plantago over Collinsia for oviposition, but around 20% accepted both 
hosts equally°. By contrast, by 1982 Plantago was already preferred for 
oviposition by a minority of preference-tested adults at Schneider®. 
Evolution of Plantago preference had begun. 

Given natural selection for oviposition on Plantago, and given that 
oviposition preferences at Schneider were both heritable (estimated 
heritability 0.9) and correlated with offspring performance!’, we 
expected to see rapid evolution of preference, which did indeed occur: 
the proportion of insects preferring Plantago for oviposition increased 
from about 7% to around 50% by 1990°. This change was heritable. 


1Biological and Marine Sciences, University of Plymouth, Plymouth, UK. 2Theoretical and Experimental Ecology Station, CNRS, Université Paul Sabatier, Moulis, France. ?Department of Geological 
Sciences, University of Texas at Austin, Austin, TX, USA. “e-mail: michael.singer@plymouth.ac.uk; camille.parmesan@plymouth.ac.uk 
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Table 1 | Survival of E. editha placed in the field on Collinsia and Plantago at Schneider’s Meadow in the 1980s 


Group survival Group survival 


Individual survival Individual survival Statistical significance 


Year and reference Life stages measured _Collinsia Plantago Collinsia Plantago (group survival) 

1980 (previously  Oviposition to 33% (n=15) 70% (n=20) Not recorded Not recorded P=0.044 

unpublished) second instar 

198229 Oviposition to 17% (n=58) 55% (n=86) 4% (n=1,810) 24% (n=2,764) P=0.000011 
second instar 

1985517 Larval survival 62% (n=63) 84% (n=62) 19.9% (n=1,260) 27.1% (n=1,240) P=0.0085 (combining data 
for ten days from from both references) 
hatching 

1986 (previously Oviposition to 29% (n=28) 80% (n=25) Not recorded Not recorded P=0.00028 


unpublished) second instar 


Proportions of groups surviving analysed as 2 x 2 contingency tables by two-tailed Fisher's exact test. 


Laboratory-raised, Collinsia-fed offspring of butterflies caught in the 
field in 1990 were significantly more Plantago-preferring than were 
similarly raised offspring of insects caught in the field in 1983°. 

Here we report that this bout of anthropogenic evolution continued 
until monophagy on Plantago was achieved. In 2005 and 2007, all 
tested females preferred to oviposit on the exotic (Fig. 1a) and in 2007 
all larvae found in the field were on Plantago (Fig. 1b). The insects 
had abandoned both their traditional host (Collinsia) and the minor 
host Penstemon rydbergii, which had been incorporated into their diet 
during the host shift (Extended Data Table 1 and Extended Data Fig. 2). 

In 1993, we wrote that this episode of anthropogenic evolution was 
“foreshadowing a new problem in conservation biology. By adapting 
genetically to human-induced changes, the insects risk becoming 
dependent on continuation of the same practices. This is a serious risk, 
because human cultural evolution can be even faster than the rapid 
genetic adaptation that the insects can evidently achieve”®. 

This prediction was fulfilled. In late 2005, following the death of 
Harry Schneider, the meadow was sold and cattle-grazing ceased. 
Grasses then grew freely, and by March 2007 96% of Plantago plants 
had become embedded in grass (Table 2 and Extended Data Figs. 3, 4a); 
E. editha larvae wandered among dense vegetation, no longer able to 
bask in sunlight on bare ground adjacent to their hosts. 

The conspicuous communal webs spun by gregarious young larvae 
render E. editha easy to census (Extended Data Fig. 4b) and population 
extinction feasible to establish. Searches for larval webs, eggs and adults 
found no E. editha in 2008, 2009, 2010 or 2012 (Fig. 1c and Extended 
Data Table 1): the Schneider population was extinct. 

By analogy with known cause-effect relationships involving other 
thermophilic butterflies, we attribute this extinction to the flush of lush 
vegetation caused by cattle removal. Lushness is associated with high 
rates of predation on butterfly larvae'® and increased lushness caused 
by the abandonment of traditional land management in Europe has 
caused ground-level cooling, which has resulted in butterfly population 
declines and local extinctions!®?!. The extinction of the large blue 
butterfly (Phengaris arion) in the UK has been attributed to micro- 
climatic cooling caused by increased lushness, which itself followed a 
reduction of grazing by rabbits after myxomatosis”’. The restoration of 
grazing and re-warming of ground-level microclimate were essential 
for the successful re-introduction of this butterfly”!. 

Particularly in sparse vegetation, sunshine creates thermal stratifi- 
cation with microclimates that are hotter close to the ground”’, which 
speeds insect development”’. We used previous observations of natural 
oviposition sites at Schneider to measure ‘eggspace’ temperatures. 
Eggspaces on exposed Plantago plants were augmented by 13.4°C 
above ambient temperature, compared to 6.0°C on embedded Plantago 
plants; if placed on embedded plants, eggs would be on average >7 °C 
cooler than if placed on exposed plants (Extended Data Table 2). In 
light of the high proportion of plants embedded in 2007 (Table 2), and 
of the known adverse effects of host embedding!*''® and microclimatic 
cooling'-*! on other butterflies, it is not surprising that this cooling 
was followed by extinction. 

As anthropogenic nutrients were used up, the flush of grasses abated 
naturally in 2008; since this time, Plantago plants exposed to full 


Previously published data New data 


Preference for 
Plantago 


15 


Percentage of adult 
females with preference 


Preference for 
Collinsia 


sv 


Percentage of 
groups on Plantago 


Number of groups per 
10,000 square metres 


Year 


Fig. 1 | Changes in E. editha oviposition preference, larval diet and 
population density. a, Changes in proportions of adults preferring to 
oviposit on Plantago and Collinsia. The two graphs are not mirror images 
because insects without preference are not shown. Changes in strength of 
preference are shown in Extended Data Fig. 7. b, Changes in larval diet. 
The absence of larvae on Collinsia in 1988-1989 was caused by temporary 
spatial restriction during and after a bottleneck; in these two years, larvae 
were restricted to Plantago and Penstemon (Extended Data Table 1 and 
Extended Data Fig. 2). c, Changes in larval density. For all three panels 
numbers within graphs represent biologically independent sample sizes; 
individual adult insects (a) or larval groups (b, c). Error bars = 95% 
confidence intervals calculated according to Newcombe”, with continuity 
correction; source data are given in Extended Data Table 3. No error bars 
are given for larval diet in years of inadequate sampling (1982, 1983 and 
2015) or total census (1988,1989 and 2014). 
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Table 2 | Estimated host densities and percentages of each host 
embedded and exposed 


Plantago Collinsia Percentage of 
density (plants density (plants Percentage of Collinsia 

Year per m?) per m2) Plantago embedded = embedded 

1984 2.3 (50in 103 (2,266 in <10% <5% 
244 30-cm? 244 30-cm? 
quadrats) quadrats) 

2005 9.4(317in 27.5 (330 <10% <5% 
152.25-m2 in 12 1-m2 
quadrats) quadrats) 

2007 0.21(18 35.2 (528 96.4% (189/196) 0.5% (69/657) 
in85 1-m? in 15 1-m? 
quadrats) quadrats) 

2008 2.8(44in 26.1 (418 18.6% (43/231) 3.8% (77/559) 
16 1-m? in 16 1-m? 
quadrats) quadrats) 

2009 2.5(48in 45.7 (869 11.8% (251/2,115) 11.7% (114/975) 
19 1-m? in 19 1-m? 
quadrats) quadrats) 

2014 3.4(173 52.4 (472 8.5% (28/328) 8.0% (150/833) 
in501-m2 —in9.1-m2 
quadrats) quadrats) 


sunlight have again been available (Table 2 and Extended Data Fig. 5). 
In 2013-2014, butterflies recolonized: an exhaustive search in 2014 
revealed nine E. editha larval webs, all of which were on the ancestral 
host Collinsia (Fig. 1b, c and Extended Data Fig. 4b). Recolonization 
had occurred from a population that resembled the starting condition, 
before the anthropogenic evolution. 

The distance from Schneider’s Meadow to the nearest known popu- 
lation of the subspecies E. editha monoensis (at Simee Dimeh summit) 
is 37.7km. How far is this to a Euphydryas? Previous** measurements 
of the colonization of empty habitat patches by the bay checkerspot 
(E. editha bayensis) found that the greatest cumulative distance travelled 
in ten years was 4.5 km. Given the sedentary nature of the butterfly and 
the physical isolation of the meadow, we did not consider the possibility 
of rapid recolonization. Our mindset in 2014 was simply to reconfirm 
the extinction, and we were very surprised to find larvae. We later 
discovered that the ‘Carter Springs’ fire in September 2012 had posi- 
tively affected the butterflies of Simee Dimeh by extending the size and 
lifespan of Collinsia (Extended Data Fig. 6), and causing a population 
boom of E. editha similar to a previously documented response to fire”. 
This provides a plausible source for the recolonizing Collinsia-feeding 
butterflies, despite the distance they would have had to travel. 

Could the original population have survived if it had retained its tra- 
ditional diet of Collinsia alongside Plantago, as it did from 1982-1987 
and 1990-2002? Collinsia was both most abundant and most used 
by the insects in dry sagebrush around the meadow edge (Extended 
Data Figs. 1-3), where removal of cattle did not result in embedding of 
Collinsia plants, even at peak lushness in 2007 (Table 2 and Extended 
Data Fig. 3). If the butterflies had adopted the exotic less completely, 
they would probably have survived the change in land use. Conversely, 
if they had remained monophagous on Collinsia they may not have sur- 
vived the bottleneck in 1988-1989°, when—after record-breaking frost 
without insulating snow (—25°C at Minden on 1 January 1988)—the 
population was spatially restricted to a small, sheltered, south-facing 
area from which Collinsia was coincidentally absent (Fig. 1b, c; see 
discussion of this event in Extended Data Fig. 2). 

The evolution of E. editha at Schneider illustrates the process by 
which, over thousands of years, European grassland butterflies evolved 
widespread dependence on human haymaking and grazing, render- 
ing them vulnerable to the abandonment of traditional management 
techniques’®*'. However, E. editha as a species is not threatened by 
the eco-evolutionary trap that we document. The ecotypic variation 
and rapid evolution of this species augur well for its resilience to 
environmental fluctuations, whether natural or anthropogenic”. By 
contrast, substantial perturbation occurred at the subspecies level, as 
E. e. monoensis is currently restricted to four known sites: two isolated 
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populations and two metapopulations distributed along 235 km of the 
eastern Sierra Nevada from McGee Creek at latitude 37° 18’ 24” N to 
Schneider at latitude 39° 6’ 33’’N. 

Unless the Schneider population sent out successful propagules, 
which we judge to be unlikely, the lineage that we observed from 
1982 to 2007 is extinct. At the population level, the changes that we 
observed exemplify marked, oscillating anthropogenic evolution of a 
species not directly targeted by humans. This example of small-scale 
oscillating diet evolution oddly mimics the repeated recolonizations 
of abandoned hosts detected across millions of years by phylogenetic 
analysis of the butterfly family Nymphalidae, to which E. editha 
belongs”®. 

Attempts at the ‘evolutionary rescue’ of wild species” may be 
compromised when anthropogenic traps such as the one documented 
here remain undetected. These traps may be cryptic to humans, but 
understanding them may become increasingly important to species 
conservation in the Anthropocene. 
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METHODS 


Statistics. No statistical methods were used to predetermine sample size. All tests 
are two-tailed. Error bars on figures—given only where justified by sampling tech- 
niques—are 95% confidence interval (c.i.), calculated using http://vassarstats.net/ 
and the methodology of Newcombe” for c.i. of proportions, with continuity cor- 
rection. Figure la does not explicitly depict insects without preference but they 
can be deduced by subtracting the depicted percentages from 100%. Otherwise 
no relevant data have been omitted from any experiment or set of observations. 
Sample sizes were largely limited by feasibility. 

Blind preference testing. Oviposition preference tests to estimate heritability!” 
were performed blind, in the sense that the tester did not know which insects were 
siblings or offspring of particular parents. The high heritability estimate from this 
blind testing (0.9) gave confidence that the tests were not subject to severe observer 
bias; repeated blind testing of the same butterfly by different observers has also 
given confidence. However, in the present analyses the appropriate blind test would 
require the tester to not know which year it was, and we were not able to achieve 
this without compromising the quality of the data. 

Oviposition preferences. To generate the data shown in Fig. 1a, butterflies were 
captured in the field and their oviposition preferences tested by a standardized 
technique, in which encounters are staged between the tested insect and each plant 
in alternation. Plants were undisturbed in their natural habitats or freshly trans- 
planted into pots in their own soil. Acceptance of plant taste was judged from full 
abdominal curling and extrusion of the ovipositor for 3 s*!. Acceptance and rejec- 
tion were recorded at each encounter, but oviposition was not allowed*!. Videos 
showing acceptance in such staged encounters have previously been published”. 
During each test the range of plants that would be accepted, if encountered, 
expands over time with increasing motivation to oviposit. Therefore, acceptance 
of plant A followed by rejection of plant B is recorded as preference for A over B. 
Testing of assumptions underlying this technique have previously been described?!. 
Because insects without preference are not shown in Fig. 1a, percentages do not 
sum to 100%, except for 2005 and 2007, when preference for Plantago was unani- 
mous among tested butterflies. Raw data are shown in Extended Data Table 3. A 
more detailed comparison between early and late periods, showing strength as 
well as direction of preferences, is given in Extended Data Fig. 7. The assumption 
that the oviposition preferences of these insects are not influenced by prior expe- 
rience, either as larvae or as adults, is supported by previous observations and 
experiments!”!32, 

Percentages of egg clutches and/or larval groups found on Plantago. Raw data 
for Fig. 1b are shown in Extended Data Tables 1, 3. Low sample sizes in 1988 and 
1989 were total counts, reflecting a population bottleneck. The high proportion 
of larvae on Plantago in 1989 reflects a lag in recolonization of Collinsia after the 
spatial restriction in the bottleneck, shown in Extended Data Fig. 2. In 2002-2007 
we searched a larger area for larvae on Collinsia than for larvae on Plantago. For 
these years, Fig. lb—showing the proportion of larvae found on the two hosts— 
overestimates the overall proportion on Collinsia; the areas searched are given in 
Extended Data Table 1. To be conservative, we indicated data for 2002 and 2005 as 
‘published’ although neither sample sizes nor confidence limits had previously been 
given**. The graph shows that in 2015 we found a single group of hatching eggs 
on Collinsia. However, we performed no census in 2015 as our visit was too early. 
Estimates of density of larval webs. Figure 1c gives estimates of the density of 
larval webs on all hosts combined per 10,000 m/°. Raw data are given in Extended 
Data Table 1. In each census, individual host plants of all species were searched for 
eggs, larvae, larval webs and typical damage. Very different scales of census were 
conducted in different years. For small patches, every individual host plant could 
be searched; for larger patches, stratified line transects were used. Confidence 
limits are not given, but the logarithmic scale of the y axis makes clear the scale of 
population changes; for example, the raw data (Extended Data Table 1) include 
the finding of 34 groups in only 70 m? in 1982 and 4 groups in 20,000 m? in 1988. 
No density estimate is given for 2007 because, although 38 groups were found 
(Fig. 1b), at the time of the last census some were still eggs, and so were harder 
to find than larvae. Sample calculation for 2002 (see Methods for Extended Data 
Table 1): estimated number of webs on Plantago = 67 x 4,000/170 = 1,576. Total 
number on Collinsia=3 (all were counted). Estimated total number webs in entire 
area of 20,000 m= 1,579; density per 10,000 m?=789. 

Survival of E. editha larvae on Collinsia and Plantago. Effects of oviposition host 
on fitness are given in Table 1. Eggs were placed out on randomly chosen hosts 
in the field, by manipulating butterflies to lay (see previously published videos”). 
Gregarious neonate larvae were placed with a sable brush in groups of 20-35. 


After ten days of larval life (just before diapause), each group was gathered. It is in 
pre-diapause life that the principal effects of oviposition on fitness are manifest, as 
post-diapause larvae are mobile and can switch between host species. 

Estimated host densities and percentages of embedded and exposed hosts. 
Host densities were estimated from quadrats along line transects placed within the 
strata of the ecotone in which each plant was concentrated (Extended Data Fig. 1). 
Early approximate estimates of the percentage of embedded plants were derived 
from photographs, memory and anecdotal observation, because embeddedness 
was previously uncommon and its value as a trait was not anticipated before the 
sudden embedding of Plantago in 2007. A plant was classed as embedded if it 
was surrounded for >50% of its circumference by vegetation taller than itself. In 
most cases the differences between embedded and exposed plants were striking 
(Extended Data Fig. 3); in the March census in 2007, some embedded plants were 
hidden and not found—they were pressed under thatch from winter snow and 
inaccessible to post-diapause larvae feeding in March—but reappeared and, if they 
were no longer completely embedded, were accessible to ovipositing butterflies in 
May. Extended Data Fig. 4a has a photo of eggs naturally laid on such a reappearing 
Plantago, emerging from winter thatch, in May, 2007. 

Numbers of groups of E. editha on Plantago, Collinsia and Penstemon. Extended 
Data Table 1 contains counts over wider areas and including lower host densities 
than the ‘core’ areas censused in Table 2. The total area within which suitable 
Plantago might be found was around 4,000 m’; the total area that might contain 
suitable Collinsia was larger, maximally about 17,000 m? (1982 map in Extended 
Data Fig. 2) but less in dry years. About 1,000 m? overlapped between the two 
distributions. Wider areas were searched in 1988 and 1989 to check whether we had 
missed part of the population in previous work. We had not, so as we found no habitat 
in this wider search, subsequent searches were restricted to meadow margins 
and adjacent sagebrush, approximately 20,000 m? (Extended Data Figs. 1, 2). 

In 2002, 2005 and 2007, the entire area in which larvae might have been found 
on Collinsia was searched; Plantago areas were not searched in their entirety, but 
merely sufficiently to get an estimate of plant density and occupancy by the but- 
terflies. After the extinction in 2007-2008, the entire habitat was searched in each 
census. In most years, more plants were searched for E. editha than were included 
in censuses to estimate plant density reported in Table 2. 

Extended Data Table 1 shows data from areas censused in which plants with 
and without larvae were counted. Where maps in Extended Data Fig. 2 show more 
insects than in Extended Data Table 1, as in 1982, the insect distribution in the 
map is derived from rapid assessments in which insects were observed and counted 
but plants were not. 

Temperatures of eggspaces on Collinsia and Plantago. Eggspace tempera- 
tures given in Extended Data Table 2 were measured with a fine thermocouple 
(MT-29/1B insect probe, type T, copper-constantan). After measuring each 
exposed Plantago we measured one or two adjacent (within 1 m) embedded plants 
growing in the same position within the ecotone; we took care that embedded 
plants were not systematically growing in more humid microsites. When two 
embedded plants were measured, we used the mean value of the two for analysis. 

We found three errors in our previous publications: (1) number of groups found 
on Collinsia in 1990 is here corrected to 18 from 6°; (2) number of butterflies 
preference-tested in 1986 is corrected to 36 from 31>; (3) a misleading early refer- 
ence to the population as feeding on Plantago in 1969** stemmed from initial visit 
in June, after Collinsia had senesced and disappeared. Faster-growing larvae on 
Collinsia had entered diapause and were not found, although some still remained 
feeding on Plantago. The next visit in 1971 made it clear that although Plantago was 
already being used, Collinsia was the principal host. No censuses were performed 
in 1969-1971. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. Raw data are included in Fig. 1, Tables 1, 2, Extended Data 
Figs. 1-7 and Extended Data Tables 1-3. All other data are available from the 
corresponding authors upon reasonable request. 


31. Singer, M. C., Vasco, D. A., Parmesan, C., Thomas, C. D. & Ng, D. Distinguishing 
between preference and motivation in food choice: an example from insect 
oviposition. Anim. Behav. 44, 463-471 (1992). 

32. Singer, M. C., Wee, B., Hawkins, S. & Butcher, M. in The Evolutionary Ecology of 
Herbivorous Insects: Speciation, Specialization and Radiation (ed. Tilmon, K. J.) 
311-324 (Univ. of California Press, Berkeley, 2008). 

33. Singer, M. C. Evolution of food-plant preference in the butterfly Euphydryas 
editha. Evolution 25, 383-389 (1971). 
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Extended Data Fig. 1 | Habitat and host distributions at Schneider’s 
Meadow. a, The butterfly habitat is a single, isolated, spring-fed wet 
meadow in the centre of the photograph, surrounded by non-habitat for 
the butterflies: dry sagebrush scrub and coniferous forest. b, Distribution 
in the meadow-edge ecotone of the principal hosts, Plantago and Collinsia, 
plus the minor host Penstemon. c, Typical difference in phenology between 
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Plantago and Collinsia in May 2014. In the foreground are red, senescent 
Collinsia plants that are edible to the insects but which will die within a 
few days; behind them is a single green, budding Plantago that will remain 
edible until after all E. editha larvae have entered diapause. d, Hatching egg 
clutch on Collinsia cotyledon in hot, dry microhabitat. 
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Eggs/larval groups on Plantago 


Extended Data Fig. 2 | Changes in distribution of early stages of 

E. editha (eggs or larvae) from 1982 to 2007. Data were added by hand to 
the GoogleMaps image. Most stars represent several groups. For example, 
in 1989, 23 groups were found on Plantago and one on Penstemon. The 
restricted distribution in that year followed a bottleneck in 1988 after 
record-breaking cold in January, without the usual insulating snow 

cover. Schneider’s Meadow is at 1,700 m elevation: nearby towns at lower 
elevations recorded —25°C on 1 January 1988 (Minden, 1,444 m elevation) 
and —20°C on 18 January (Carson City, 1,424 m). Note the recolonization 


Eggs/larval groups on Plantago 


of Collinsia as the insects expanded back into the distribution of Collinsia 
in 1990 and 1993. Larval groups recorded in 1988 and 1989 were clustered 
around an attractive nectar source (Wyethia sp.); it is possible that adults 
attracted to this nectar in 1988 had survived as larvae on Collinsia in 
1987-1988 and then, as adults, laid eggs in 1988 on Plantago adjacent to 
nectar. This possibility prevents us from making a definite conclusion that 
the population would have become extinct if eggs laid in 1987—before the 
bottleneck—had been placed only on Collinsia. Data for 2005 exist and 
closely resemble those for 2002. 
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Extended Data Fig. 3 | Effects of cessation of grazing: Plantago plants 
embedded, whereas Collinsia plants are unaffected. Data are provided 
in Table 2. a, Plantago at Schneider in 1984, exposed to full sunlight and 
physically acceptable to ovipositing E. editha. b, Meadow edge in May 
2007, after cattle removal. In the foreground is Plantago habitat with thick 
grasses; in the background is Collinsia habitat not grassed-in, with barren 


Grass 


—> Plantago 


ves 


spaces between the sagebrush. c, Collinsia in May 2007, unaffected by 
the embedding that simultaneously affected the Plantago plants shown 
in d and e. Embedding in grasses not only cooled the Plantago plants 
(Extended Data Table 2) but also rendered them hard to find, both by 
butterflies seeking oviposition sites and by larvae seeking food. 
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Extended Data Fig. 4 | Egg clutch and communal web. a, Natural after recolonization. Second-instar larvae on Collinsia at Schneider in 
egg clutch laid in May 2007 on Plantago. The plant is pushing through May 2014. This is a single group of larvae, probably stemming from a 
winter thatch, and would have been unlikely to be acceptable to single oviposition event; there were nine such groups, all on Collinsia. 
ovipositing butterflies before cattle removal, when plants similar to the Unexpectedly, this group is not on the most exposed Collinsia available. 


one in Extended Data Fig. 3a were available. b, Communal web spun 
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Extended Data Fig. 5 | The return of mostly exposed Plantago plants after anthropogenic lushness abated. Photographs were taken in 2014, but 
Table 2 shows that they could have been taken in 2008 or subsequently. 
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o* as : ee R 
Extended Data Fig. 6 | Effect of fire on size and longevity of C. larvae at its base from a naturally laid egg clutch. The fifteen senescent 
parviflora. A single fire-enhanced Collinsia at McGee Creek (east of individual Collinsia lying on the ground represent a haphazard sample 
Bishop, California) is still blooming. There is a small web of E. editha gathered from unburned microsites within 2 m of the enhanced individual. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Green: No 


Blue: preference 
Collinsia \ boeioial Year 1983 


preference - Plantago ae 


prefe rence (data from 
30 Figure 1 of 
reference 18) 
20 4 


Years 2005 
+2007 
n=21 


Percentage of insects 
OW 
© 


20 (data not 
previously 
1 0 published) 
Discrimination phase in hours 

Extended Data Fig. 7 | Strength and direction of the oviposition would be accepted (whichever was next encountered). Insects in the 
preferences of butterflies sampled at Schneider in 1983 and blue 1-4 column would search for 1-4h, during which only Collinsia 
2005 + 2007. The number over each bar is the sample size of biologically would be accepted. If they failed to find Collinsia within 4h, they would 
independent samples: individual butterflies captured in the field. The subsequently accept either host, until actual oviposition occurred. Green 
discrimination phase is the length of time for which the insect would central bar shows butterflies without preference. Sample size for 2005- 
search, during which it would consistently accept the preferred host and 2007 is smaller than in Fig. 1a because we include on Fig. 1a (and here 
consistently reject the second-ranked host. At the end of this phase, if it omit) five butterflies for which we determined the direction of preference, 


does not succeed in ovipositing, the insect enters an acceptance phase but not the strength. 
after reaching the level of oviposition motivation at which either host 
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Extended Data Table 1 | Census results: areas searched and numbers of egg clutches or larval webs found on each host 


Year (date) Approximate area Groups on Plantago Groups on Collinsia Groups on Penstemon 
searched (m’) (approximate number of (approximate number of (approximate number of 
plants searched) plants searched) plants searched) 
1982 (19-21 May) 70 4 (67) 28 (441) 2 (233) 
1983 (2 June) 20 3 (89) 11 (506) 0 (112) 
1984 (17-31 May) 1,250 4 (79) 27 (1169) 0 (94) 
1986 (26 April-17 May) Not recorded 62 (not counted) 50 (not counted) 10 (not counted) 
1987 (7-8 May) 2,030 8 (260) 10 (892) 0 (190) 
1988 20,000 3 (434) 0 (>3,000) 1 (87) 
1989 50,000 23 (462) 0 (5,500) 1 (504) 
1990 (13 May) 2,000 25 (135) 18 (1789) 2 (365) 
1993 (1-3 June) 5,800 22 (1104) 9 (1390) 1 (116) 
New data below here 
2002 170 for Plantago 64 (770) 3 (>5000) 1 (64) 
16,000 for Collinsia 
2005 (6-12 June) 65 for Plantago 67 (317) 1 (>3000) 0 (209) 
16,000 for Collinsia 
2007 800 for Plantago, 38 (211) 0 (>2000) 0 (30) 
(17-18 March, 5 May) 16,000 for Collinsia 
2008 4,000 for Plantago, 0 (2115) 0 (>4,000) 0 (88) 
16,000 for Collinsia 
2009 4,000 for Plantago, 0 (1840) 0 (>3,000) 0 (31) 
16,000 for Collinsia 
2014 (11-12 May) 4,000 for Plantago, 0 (1125) 9 (>3,000) 0 (46) 
16,000 for Collinsia 
2015 (28-31 May) n/a 0 (n/a) 1 (n/a) 0 (n/a) 


Data for 1982-1993 have previously been published®. These are not the same data shown in Extended Data Fig. 2 (see Methods). Extension of survey area 
additional habitat, so density in Fig. 1c for tha 


year is calculated using the estimated maximum habitat area of 20,000 m2, giving a higher estimate than t 
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to 50,000 m? in 1989 did not reveal 
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Extended Data Table 2 | Measurements of temperature excess over ambient air at 1-m height at three potential types of oviposition site at 
Schneider’s Meadow: embedded Plantago, exposed Plantago and exposed Collinsia 


native Collinsia 


exotic Plantago 


exposed 15.6, 10.8, 9.5, 14.8, 8.8, 8.4, 
10.2, 12.1, 10.4, 11.6,11.1,11.0,12.2,9.4, 
: , 11.8,17.4,11.2,19.3,16.5, 
13.9,19.0,10.6 
N=12 
N= 18 
embedded 


7.9,5.2,7.2, 8.3, 4.1, 9.4, 
5.1, 9.7, 5.3, 1.2, 2.5, 6.0, 
3.1, 5.4, 7.9, 9.2, 3.3, 7.5 


N=18 


Measurements were taken at eggspace height (2-4.cm) and at 1 m above ground between 12:00 and 15:00 in May 2015. Each measurement of exposed Collinsia or Plantago came from a different, 
haphazardly chosen habitat patch. On occasion we measured two embedded Plantago plants in the same patch; when this occurred we show the mean of the two values and used these means in 


analysis, treating the two plants as a single sample. Consequently, each data point represents a biologically independent sample, and the independent sample sizes are 12, 18 and 18 as indicated at 


the base of each cell. 


Comparison between exposed Plantago (mean excess temperature above ambient = 13.4°C) and embedded Plantago (mean excess temperature above ambient = 6.0 °C) by two-sided t-test: f= 7.55, 


degrees of freedom = 28, P< 0.0001. 


Comparison between exposed Collinsia (mean excess temperature above ambient = 11.46 °C) and exposed Plantago (13.4°C) by two-sided t-test: t= 1.77, degrees of freedom = 28, P=0.09. 
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Extended Data Table 3 | Source data for means and confidence limits of adult preference and larval diet 


Year Sample size Number preferring Proportion Lower 95% c.i. Upper 95% c.i. 
adult females Plantago preferring Plantago 

1982 44 3 0.068 0.0178 0.197 

1983 34 8 0.235 0.114 0.416 

1985 59 12 0.203 0.114 0.332 

1986 36 11 0.306 0.169 0.483 

1990 27 14 0.519 0.324 0.708 

2005 11 11 1.00 0.678 1.00 

2007 15 15 1.00 0.747 1.00 

Year Sample size Number preferring Proportion Lower 95% c.i. Upper 95% c.i. 
adult females Collinsia preferring 

Collinsia 

1982 44 27 0.614 0.455 0.752 

1983 34 16 0.471 0.301 0.646 

1985 59 25 0.423 0.299 0.559 

1986 36 16 0.444 0.283 0.617 

1990 27 5 0.185 0.070 0.388 

2005 11 0 0 0 0.321 

2007 15 0 0 0 0.254 

Year Sample size Number of larval Proportion of Lower 95% c.i. Upper 95% c.i. 
larval groups groups on Plantago | groups on Plantago 

1984 31 4 0.129 0.042 0.308 

1986 122 62 0.508 0.417 0.599 

1987 18 8 0.444 0.224 0.687 

1990 45 25 0.556 0.401 0.700 

1993 32 22 0.688 0.499 0.833 

2002 69 64 0.928 0.832 0.973 

2005 68 67 0.985 0.910 0.999 

2014 9 0 0 n/a (complete 

census) 


Source data for Fig. 1a, b. Statistical test described in the legend of Fig. 1. 
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Subepithelial telocytes are an important source of 
Wnts that supports intestinal crypts 


Michal Shoshkes-Carmel!, Yue J. Wang!, Kirk J. Wangensteen!, Beata Toth?, Ayano Kondo!, Efi E. Massassa?, Shalev Itzkovitz? & 


Klaus H. Kaestner!* 


Tissues that undergo rapid cellular turnover, such as the mammalian 
haematopoietic system or the intestinal epithelium, are dependent 
on stem and progenitor cells that proliferate to provide differentiated 
cells to maintain organismal health. Stem and progenitor cells, in 
turn, are thought to rely on signals and growth factors provided by 
local niche cells to support their function and self-renewal. Several 
cell types have been hypothesized to provide the signals required 
for the proliferation and differentiation of the intestinal stem cells 
in intestinal crypts!-®. Here we identify subepithelial telocytes as an 
important source of Wnt proteins, without which intestinal stem 
cells cannot proliferate and support epithelial renewal. Telocytes 
are large but rare mesenchymal cells that are marked by expression 
of FOXL1 and form a subepithelial plexus that extends from the 
stomach to the colon. While supporting the entire epithelium, 
FOXLI‘ telocytes compartmentalize the production of Wnt 
ligands and inhibitors to enable localized pathway activation. 
Conditional genetic ablation of porcupine (Porcn), which is required 
for functional maturation of all Wnt proteins, in mouse FOXL1* 
telocytes causes rapid cessation of Wnt signalling to intestinal crypts, 
followed by loss of proliferation of stem and transit amplifying cells 
and impaired epithelial renewal. Thus, FOXL1* telocytes are an 
important source of niche signals to intestinal stem cells. 

After the cloning of Foxl1, which encodes the gene for the winged- 
helix transcription factor FOXL1 (previously known as Fkh6’), initial 
expression profiling focused on measuring its steady-state mRNA 
levels, both by RNase protection assays and by in situ hybridization®”. 
Foxl1 transcripts were found at low levels in the adult stomach and 
intestine, and in about one to two cell layers of the mesodermal tissue 
surrounding the primitive fetal gut tube. In order to visualize subep- 
ithelial Foxl1-expressing cells in their entirety, we used the Foxll-cre 
transgenic line!” in conjunction with genetic lineage labelling using 
the Rosa26-mTmG allele!!. In this model, Cre-labelled cells produce 
a membrane-targeted version of green fluorescent protein (GFP), 
which allows us to map the size and location of Foxl1-expressing cells 
(Fig. 1a). Figure 1b shows an example of duodenal crypts, where Foxl1- 
expressing cells, identified both by nuclear FOXLI staining (white) 
and membraneous GFP expression (green), are in close contact with 
the entire crypt base, identified by EPCAM staining (red). FOXL1T 
cells are large, flat cells with cytoplasmic processes in excess of 100 jm. 
Cells with these properties, that is, very large mesenchymal cells with 
extended cell bodies, were identified previously by electron microscopy 
and termed ‘telocytes; with the long processes termed ‘telopodes’*?. 

Intestinal telocytes express platelet-derived growth factor receptor 
alpha (PDGFRa)!*. PDGFRa staining overlapped with Foxl1-cre- 
induced GFP expression, further supporting the notion that these cells 
are intestinal telocytes (Fig. 1c, d), although PDGFRa is also present 
in numerous cells deeper in the submucosa that do not express Foxl1. 
Immunoelectron microscopy confirmed that FOXL1* cells form long 
telopodes that encompass all crypt cells, and are separated from the 
epithelium by sub-micrometre distances (Fig. le—g). 


In order to gain a better understanding of the three-dimensional 
nature of the FOXL1* telocyte network, we performed confocal imag- 
ing of immunofluorescence-stained jejunum following tissue clearing. 
FOXL1* PDGFRa‘t telocytes form a plexus that supports the entire 
epithelium, visualized by EPCAM staining (Fig. 1h and Supplementary 
Video). Thus, FOXLI* PDGFRat positive telocytes maintain a sub- 
epithelial sheath that is juxtaposed to the epithelium, and therefore in 
an ideal position to act as a niche to provide signals to the stem and 
progenitor cells in the intestinal crypt. 

For the molecular characterization of FOXL1* telocytes, we sorted 
cells based on Foxl1-cre;Rosa26-mTmG activity (Extended Data Fig. 1), 
performed RNA sequencing analysis (RNA-seq), and compared the 
expression profiles to those of FOXL1~ mesenchymal cells, LGR5* 
stem cells (sorted by Lgr5-eGFP"'8"!> expression) and epithelial entero- 
cytes (Fig. 2a). Note that while FOXL1* cells express high levels of 
Pdgfra and Cd34, these two genes are also active in FOXL1~ cells, 
confirming our observation from immunostaining and genetic line- 
age tracing that FOXL1* cells constitute a small subset of PDGFRat 
cells. FOXL1* telocytes express high levels of multiple regulators—both 
activators and repressors—of key signalling pathways, such as Wnt, 
SHH, Bmp and TGF-§ (Fig. 2b-g). While the Wnt ligands WNT2B 
and WNT5A are specifically expressed in FOXL1* telocytes, the Wnt 
signalling enhancer and LGRS ligand R-spondin3 is produced by both 
FOXLI* telocytes and FOXL1~ mesenchymal cells (Fig. 2c; Extended 
Data Table 1). Notably, FOXL1* telocytes also express the Wnt inhib- 
itors DKK3 and SFRP1. Similarly, FOXL1* telocytes express BMP4, 
BMP5, BMP6 and BMP7 as well as the Bmp inhibitors chordin-likel 
and gremlin] (Fig. 2d). 

The results of the expression profiling of FOXL1* telocytes summa- 
rized above present an apparent paradox. How can FOXLI* telocytes 
provide Wnt ligands to activate Wnt signalling in intestinal crypts if 
they also produce Wnt pathway inhibitors? We speculated that there 
might be a regional differentiation of the transcriptome of FOXL1* 
telocytes according to their position along the crypt-—villus axis. In 
order to test this hypothesis, we performed single-molecule fluores- 
cence in situ hybridization (smFISH) to detect mRNA encoding acti- 
vators (WNT2B, WNT5A and R-spondin 3) and inhibitors (SFRP1 
and DKK3) of the Wnt signalling pathway, and BMP5, which signals 
via activation of the Smad signalling cascade. We labelled telocytes 
by immunostaining for PDGFRo, as FOXL] is a nuclear protein and 
thus does not show the full extent of the cells. As shown in Fig. 3a, telo- 
cytes express both Wnt2b, an activator of canonical Wnt signalling, and 
Want5a, which acts via the planar cell polarity pathway. However, the 
localization of the two mRNAs is clearly distinct, with Wnt2b enriched 
at the bottom of the crypts where LGR5* stem cells are situated, and 
Wnt5a mRNA present near the crypt—villus junction. Likewise, there is 
a striking pattern of co-expression of Wnt2b with Sfrp1, which encodes 
a secreted frizzled protein that acts as a decoy receptor to limit Wnt 
signalling at the bottom of the crypt; whereas towards the crypt-—villus 
junction, Sfrp1 mRNA predominates, as the density of Wnt2b mRNA 
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Fig. 1 | FOXL1* cells are telocytes and co-express PDGFRa. a, Schema 
for labelling FOXL1* cells with GFP. b, Foxl1-cre-driven GFP expression 
is restricted to pericryptal mesenchymal telocytes. FOXL1 (white, *), GFP 
(green) and EPCAM (red) immunofluorescence in cleared whole mouse 
duodenum. c, d, GFP (green) and PDGFRa (red) immunofluorescence 

in the duodenum (c) and colon (d). e-g, Immunoelectron microscopy 
showing GFP in duodenal crypt (transverse section) of a Foxl1-cre;Rosa26- 
mTmG mouse. f, Inset showing the nucleus and the telopode of a telocyte. 
n, nucleus; Tp, telopode. g, Inset showing contact between two telopodes. 
h, Confocal imaging of cleared whole small intestine showing expression 
of PDGFRa (green) and EPCAM (red). Experiments were repeated at least 
three times with similar results. Scale bars, 10 jm. 
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Fig. 2 | Telocytes express key signalling molecules. RNA-seq analysis of 


FOXLI* cells (1 =3 mice) compared to FOXL1~ mesenchymal cells (Mes., 
n=3 mice), LGR5* stem cells (n =2 mice) and differentiated enterocytes 
(Diff., n = 2 mice). a, Heat map showing hierarchical clustering. Only 
genes with more than tenfold difference in expression level in pairwise 
comparisons and a false discovery rate (FDR) of less than 2% were used for 
the analysis. Values are shown as normalized Z score +s.d. b-g, Selected 
markers showing differential expression: Foxll, Pdgfra and Cd34 (telocyte 
markers), Lgr5 and Slc5a1 (b), WNT pathway (c), BMP pathway (d), SHH 
pathway (e), growth factors (f) and TGF-6 pathway (g). Data are shown as 
fragments per kilobase million (FPKM). Error bars indicate s.e.m. 


molecules decreases (Fig. 3f, j). Rspo3, which encodes an important 
co-activator of the Wnt pathway, is present in telocytes only at the bottom 
of the crypt, and is absent further up the crypt-villus axis (Fig. 3c). By 
contrast, transcripts for Bmp5 and the Wnt signalling inhibitor Dkk3 
are present at higher levels near the crypt-villus junction than the crypt 
base (Fig. 3d). 

Next, we quantified the density of the transcripts for these signalling 
mediators in telocytes, subdivided by their position along the crypt- 
villus axis (Fig. 3e-k). Expression of Wnt2b and Rpso3 was much 
higher in telocytes at the crypt base than at the villus tip (Fig. 3f, g), 
while expression of Dkk3 was increased towards the villus tip (Fig. 3k) 
and that of Sfrp1 expression was only marginally dependent on loca- 
tion (Fig. 3j). Expression of Bmp5 and WntSa was elevated towards 
the crypt-villus junction and villus tip, suggesting that the planar cell 
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Fig. 3 | Telocytes compartmentalize signalling molecule mRNAs for 
localized signalling. a-d, Cryosections of mouse duodenum stained 
for PDGFRa (blue), DNA (grey) and hybridized with smFISH probes 

as indicated. The numbered insets are shown at higher magnified. e, For 
mRNA quantification, fluorescence signals were counted per defined 


polarity pathway and BMP signalling are more active away from the 
crypt. These data indicate that telocytes compartmentalize transcripts 
depending upon their position along the crypt-—villus axis to locally 
promote proliferation by maintaining a higher Wnt:SFRP ratio near 
the crypt base than at the villus epithelium. To obtain independent 
validation of transcriptome specialization among FOXL1* telocytes, 
we also performed single-cell reverse transcription with quantitative 


Control 


Duodenum 


Duodenum 


Fig. 4 | The intestinal epithelium depends on Wnts secreted from 
FOXLI* telocytes. a—c, Control and PorcnA mutant duodenum, one and 
three days after tamoxifen injection. Experiments were repeated at least 
three times with similar results. d, e, Duodenal crypt depth (control, n =3 
mice; PorcnA 24h, n =3 mice) and villus length (control, n =9 mice; 
PorcnA 24h, n=3 mice; PorcnA 72h, n=7 mice; ***P <0.001, unpaired 
two-tailed t-test). f-h, Control and PorcnA mutant colon, one and three 
days after tamoxifen injection. Experiments were repeated at least three 
times with similar results. i, Colonic crypt depth (control, n =7 mice; 
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area along the crypt—villus axis. f-k, mRNA molecules per cubic 1m 
along the crypt-villus axis. Experiments were repeated three times with 
similar results. Scale bars, 10 j1m. Black bars represent the mean value. 
**P < 0.001, *P <0.1; ordinary one-way ANOVA multiple comparison 
with nonparametric Kruskal-Wallis test. NS, not significant. 


PCR (qRT-PCR) for key genes on fluorescence-activated cell sorting 
(FACS)-sorted Foxl1-cre;Rosa26-mImG GFP-positive cells and iden- 
tified clearly distinct expression profile subtypes among FOXL1* cells 
(Extended Data Fig. 2). Future studies will be needed to determine 
whether these are temporal states that are dependent on telocyte 
migration along the radial crypt-villus axis, or whether they represent 
permanent telocyte subtypes. 
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PorcnA 72h, n=3 mice; PorcnA 72h, n=7 mice; ***P <0.001, unpaired 
two-tailed t-test). j-1, n—p, EdU incorporation (red) in the epithelium 
(EPCAM, green) of duodenum and colon. Experiments were repeated at 
least three times with similar results. m, Number of proliferating cells per 
duodenal crypt. n=3 mice per group; ***P <0.001, unpaired two-tailed 
t-test. q, Number of proliferating cells per colonic crypt. n =3 mice per 
group; ***P <0.001, unpaired two tailed t-test. Centre lines, mean; error 
bars, s.d. Scale bars, 100 1m. 
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Fig. 5 | FOXL1* telocytes provide essential Wnt ligands to the intestinal 
stem cell compartment. a, b, Wnt pathway activation analysed by 
immunohistochemistry for }-catenin (brown). The indicated regions are 
shown at high magnification in the insets. c-j, Immunofluorescence of 

the Wnt targets cyclin D1 and SOX9. k-n, Stem cell markers Olfm4 (in 
situ hybridization) and CD44 (immunohistochemistry), one day post 
induction in PorcnA duodenum. Insets show a higher magnification of the 


To address the question of whether Wnt production by FOXL1* 
telocytes is important for intestinal stem cell activity, we derived a 
Foxl1-creERT2 mouse line to enable temporally controlled gene abla- 
tion in this cell type (Extended Data Fig. 3a). To confirm that the 
Foxl1-creERT2 transgene, which is based on the same 170-kb bacterial 
artificial chromosome (BAC) transgene as we employed previously to 
obtain faithful reproduction of the Foxl1 gene expression pattern’, 
is active in subepithelial telocytes, we crossed these mice to Rosa26- 
mTmG reporter mice (Extended Data Fig. 3b, c) and treated them with 
tamoxifen to induce Cre activity. In both small and large bowel, Foxl1- 
creERT2 was active in subepithelial telocytes as intended. 

Next, we crossed Foxll-creERT2 mice with mice carrying a floxed 
allele of the X-linked Porcn gene!>, to obtain Porcn!°*?/Y; Foxl1-creERT2 
mice (hereafter referred to as PorcnA, Extended Data Fig. 3d). This 
model enables conditional ablation of all Wnt secretion from FOXL1- 
expressing telocytes. We analysed PorcnA mice after 24 and 72h of 
tamoxifen treatment. Notably, while epithelial morphology was nor- 
mal at the 24-h time point (Fig. 4a, b, f, g), stem and progenitor cell 
proliferation in duodenal and colonic crypts was already markedly 
reduced, as determined by labelling cells in S-phase with a short pulse 
of the thymidine analogue ethynyl deoxyuridine (EdU) (Fig. 4j, k, m, 
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indicated regions. 0, p, Expression of Lgr5 and Rspo3 mRNA as detected 
by smFISH, three day post induction in PorcnA duodenum. *denotes a 
staining artefact. q, r, Immunofluorescence for EPCAM and PDGRFa 
in duodenum of control and PorcnA mice. Experiments were repeated 
at least three times with similar results. In a—n, scale bars, 100 1m (main 
panel), 251m (inset). 


n, 0, r). Consequently, duodenal crypt depth was significantly reduced 
even after one day of porcupine ablation (Fig. 4d). After three days of 
tamoxifen treatment, PorcnA mice exhibited marked changes in the 
epithelial architecture in both the small and large intestine (Fig. 4c, h), 
which we attribute to lack of resupply of the epithelium with newly 
differentiated cells from the crypts. Epithelial proliferation rates were 
reduced even further at this time point (Fig. 41, m, p, r). As a result, 
the length of the villi was reduced in the duodenum (Fig. 4c, e), 
and epithelial cells appeared abnormal in colonic crypts (Fig. 4h), 
which also exhibited a significant reduction in depth (Fig. 4i). Thus, 
eliminating Wnt secretion from FOXL1* telocytes results in a large 
and highly significant reduction in epithelial cell proliferation in the 
small and large intestine. 

In order to test directly whether FOXL1T telocytes are a relevant 
source of Wnt signals to the intestinal crypt, we stained control and 
PorcnA intestines for the presence of nuclear B-catenin, a marker of 
active Wnt signalling, as well as cyclin D1 and SOX9, proteins encoded 
by direct target genes of 3-catenin. As shown in Fig. 5a, b, while the 
control duodenum showed the typical distribution of 3-catenin protein 
at the adherens junctions of all epithelial cells and in the nuclei of cells 
with active Wnt signalling in the crypts, the PorcnA intestine had lost 
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nuclear B-catenin protein expression. Likewise, expression of cyclin 
D1 and SOX9 was reduced markedly in the duodenum and colon of 
PorcnA mice (Fig. 5c-j). Indeed, we found that expression of Olfm4, 
Lgr5 and CD44, all markers of stem cells in the crypt base!®!”, was 
lost as early as 24h following porcupine ablation (Fig. 5k—p). Taken 
together, these findings demonstrate that FOXL” telocytes are 
necessary for Wnt pathway activation and proliferation in intestinal 
stem and progenitor cells. Recently, systemic administration of ligand- 
specific pharmacological perturbations showed that Wnt and 
R-spondin ligands represent non-equivalent ligand families that 
cooperate to maintain intestinal stem cells, with the former having a 
‘priming’ effect and the latter driving stem-cell expansion!®. Therefore, 
we evaluated whether Wnt production by FOXL* telocytes might indi- 
rectly influence R-spondin gene expression. Remarkably, we found 
reduced Rspo3 mRNA levels in telocytes in the PorcnA intestine 
(Fig. 50, p). Telocytes were still present even after 72h of Porcn gene 
ablation (Fig. 5q, r), suggesting the possibility that reciprocal signalling 
from the epithelial compartment might be responsible for maintain- 
ing Rspo3 gene expression. In sum, sub-epithelial FOXL1* telocytes 
contribute important Wnt signals to intestinal crypt cells to main 
their proliferation and allow for homeostatic renewal of the intestinal 
epithelium. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 


Foxl1-creERT2 mice. Foxll-creERT2 mice for tamoxifen-inducible gene ablation 
of loxP-flanked targets were derived by introducing a creERT2 cassette, cloned 
from a K18iresCreER-T2 plasmid’? (Addgene, #44580), into the coding region of 
the mouse Forkhead box 11 (Foxl1) gene in the BAC RP23-446J14 by BAC recom- 
bineering as described previously’”. The targeting primers were: forward, ctccccg 
giggagcegagagectgctgtcgccatgagccacctcttcagtaagccagcagaagcaggstac; reverse, 
agcactgtggcctcgggaacctgcagaggaaaataagaaaggaaggeaatctaggaatctagtgctcccaaages 
tacctge. 

The resulting Foxl1-creERT2 BAC DNA was linearized and microinjected into 
fertilized oocytes. The positive transgenic founders were identified by genomic 
PCR, and the fidelity of transgene expression was tested by crossing Foxl1-creERT2 
transgenic mice to the Rosa26-mTmG reporter line’? and visualizing fluorescent 
protein expression in tissue sections (Extended Data Fig. 3b, c). 
Foxl-CreERT2;PorcnA, Foxll-creERT2;Rosa26-mTmG and Foxll-cre; 
Rosa26-mTmG mice. Foxl1-creERT2 mice were crossed with Porcn-ex3-7Neo- 
flox!°, Rosa26-membrane-targeted dimer tomato protein or membrane- 
targeted green fluorescent protein (mG)"’, or Rosa26-YFP”° mice to obtain Foxl1- 
CreERT2;PorcnA, Foxll-creERT2;Rosa26-mTmG, or Foxl1-creERT2;Rosa26-YFP 
mice. Non-inducible Foxl1-cre mice'® were also crossed to Rosa26-YFP and 
Rosa26-mTmG reporter lines (Jackson Laboratories). The animal experiments were 
performed according to a protocol approved by the University of Pennsylvania 
Institutional Animal Care and Use Committee. 

Tamoxifen treatment. For Foxll-creERT2;PorcnA and control (Foxl1- 
creERT2;Rosa26- YFP or Foxll-CreERT2;Rosa26mTmG) mice, tamoxifen (Sigma- 
Aldrich, Cas#10540-29-1) was dissolved in corn oil at a concentration of 30 mg/ml 
by shaking overnight at 37 °C and administered intraperitoneally at 75 mg tamox- 
ifen per kg body weight. Mice were injected on three consecutive days and killed 
on day 4, corresponding to 72h tamoxifen treatment, or day 2, corresponding to 
24h tamoxifen treatment. 

EdU treatment. For EdU treatment, 101M EdU solution was injected intraperi- 
toneally two hours before mice were killed, and EdU was detected according to 
the manufacturer's protocol (Click-iT Plus EdU Alexa Fluor 555 imaging kit C 
10638, Molecular Probes). 

FACS, RNA isolation and sequencing library preparation. Isolation of FOXL1*+ 
cells using FACS was performed using Foxl1-cre;Rosa26-YFP and Foxl1-cre;Rosa26- 
mTmG mice. Small intestines were dissected and washed thoroughly with Hank's 
balanced salt solution (HBSS) and were incubated in 5mM EDTA in HBSS for 
10 min at 4°C. Intestinal villi were scraped off using a coverslip and the remain- 
ing tissue was cut into small pieces and incubated in 5mM EDTA and HBSS on 
ice for 10 min while pipetting to completely remove the remaining epithelium. 
After vigorous washes, the remaining mesenchymal fraction was incubated with 
6mg/ml Dispase II/0.05% trypsin solution (Sigma-Aldrich, 04942078001) sup- 
plemented with 1 U/ml DNasel (Invitrogen) at 37°C, until the solution became 
cloudy and the mesenchyme was dissociated (15 min). Supernatant was filtered 
through a 70-.m strainer, centrifuged and resuspended with anti CD45-APC and 
anti EPCAM-APC antibodies (1:300, Biosciences, and 1:100, BioLegend, respec- 
tively) in 10%FBS/5mM EDTA/HBSS for 45 min at room temperature to exclude 
immune and epithelial cells from sorting. A single-cell suspension was obtained by 
filtering the supernatant through a 40-j1m strainer and washing with 5mM EDTA 
in HBSS, before cell sorting using a BD influx instrument (BD Biosciences). For 
RNA isolation, GFP* and Tomato* cells for FOXL1* and FOXL1~ mesenchymal 
cells, respectively, were lysed and total RNA was isolated by column purification 
(Absolutely RNA Nanoprep Kit, Agilent Technologies). cDNA was synthesized and 
amplified using Ribo-SPIA technology (Ovation RNA-Seq System V2, NuGEN 
7102). cDNA was sonicated and an mRNA sequencing library was prepared using 
the NEBNext RNA library prep kit (New England BioLabs). All libraries were 
sequenced on the Illumina Hiseq 2500 with 100-bp single-end reads. Median read 
depth was 27 million reads per sample. 

RNA-seq analysis. RNA-seq was aligned to mm9 using RUM”! with default param- 
eters. For comparison LGRS* and differentiated enterocyte data sets were used”. 
Differential expression was performed using edgeR”’. For heatmap analysis, dif- 
ferentially expressed genes were selected by comparisons of FOXL1* population 
with the other populations with 1% FDR cut-off and fold change of two. Genes 
were ranked by their relative fold change. 

Single cell sorting and qRT-PCR analysis. Single Foxl1* cells were sorted into 
96-well plates with 5 ul lysis buffer. Cell lysis, reverse transcription, gene-specific 
preamplification (22 cycles) and downstream quantitative PCR was performed 
according to the manufacturer’s instructions (Fluidigm Biomark HD, 68000088 
K1). A total of 179 cells and 39 genes were analysed in the experiment. Outliers 
were removed using the identifyOutliers function from SINGuLAR software with 
detection limit set to 24 cycles (Fluidigm PN100-5066 F1). The expression value 
(Log Ex) for each gene was computed according to the SINGuLAR user manual 
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(Fluidigm PN100-5066 F1). For hierarchal clustering, genes were clustered by the 
Pearson correlation method; cells were measured by their Euclidean distance and 
clustered by the ward.D method, all implemented by the hclust function in R. To 
assess the stability of clusters, we applied the clusterboot algorithm from the fpc 
package to calculate Jaccard coefficient based on 1,000 bootstrapping”. 
Transmission electron microscopy. Foxl1cre;Rosa26-mTmG and C57BL/6 con- 
trol mice were used for immunoelectron microscopy. The ileum was dissected 
and fixed for 2h at 4°C with paraformaldehyde-lysine-periodate (PLP) fixative”. 
After washing with phosphate-buffered saline, the specimens were sectioned 
with a vibratome to 70|1m thickness and blocked with protein blocking agent 
(StartingBlock T20 PBS Blocking Buffer, Thermo Scientific #37539) for 30 min 
at room temperature, and then incubated in the presence of an anti-GFP anti- 
body (1:100, Abcam AB6673) at 4°C overnight. After washing, the samples were 
incubated in the presence of horseradish peroxidase (HRP) biotinylated anti-goat 
secondary antibody, and the signal was developed using 3,3-diaminobenzidine 
tetrahydrochloride (DAB) as a substrate. The ultra-thin sections were examined 
under a JEOL 1010 electron microscope fitted with a Hamamatsu digital camera 
and AMT Advantage image capture software. 

Histology. For haematoxylin and eosin staining, paraffin tissue sections were 
rehydrated, incubated in haematoxylin for 5 min, rinsed in water, dipped quickly 
in 1% acid ethanol and washed in water. Tissues then were immersed in 0.2% 
ammonium hydroxide, rinsed in water, dipped in eosin for 40s and briefly rinsed 
in water before dehydration and mounting. 

Imunofluorescence and immunohistochemistry. Mouse intestines were rinsed 
in PBS and fixed with 4% paraformaldehyde overnight, rinsed in PBS, and either 
dehydrated for paraffin embedding or immersed in 30% sucrose for 4h at 4°C, 
embedded in optimum cutting temperature compound (OCT, Tissue-Tek), and 
frozen for cryosectioning. Antigen retrieval was performed using citrate buffer 
pH 6.0 with a pressure cooker (PickCell Laboratories, Agoura Hills, CA). FOXL1 
staining required the use of cryosections, antigen retrieval, and amplification of 
signal using tyramide (TSA systems, PerkinElmer) as previously described®, CD44 
immunohistochemistry was performed as described”®. Antibodies used were as 
follows: rabbit SOX9 (1:300; Millipore AB5535), guinea pig FOXL1 (1:1,500), 
goat GFP (1:200; Abcam AB6673), mouse E-cadherin (1:250; BD Transduction 
610182), mouse $-catenin (1:200; BD Transduction 610153), rabbit EPCAM (1:100; 
Abcam AB71916), goat PDGFRa (1:100; R&D AF1062), rat CD44var (v6) (1:100; 
Biosciences BMS145). Cy2-, Cy3- and Cy5-conjugated donkey secondary antibod- 
ies were purchased from Jackson ImmunoResearch Laboratories. For immuno- 
histochemistry, horseradish peroxidase-conjugated antibodies were incubated for 
2h at room temperature and 3,3-diaminobenzidine tetrahydrochloride (DAB) was 
used as a substrate to develop the signal. For the studies requiring quantification, 
we used n= 3 per genotype or condition based on the severity of the phenotype 
observed in pilot experiments. Mice were assigned to condition by genotype and 
were age-matched from the same colony. The investigator counting the number 
of EdU-positive cells or quantifying villus length or crypt depth was blinded to 
the genotype of the mice. 

Clearing of adult mouse intestine. Foxl1-cre; Rosa26-mTmG and C57BL/6 adult 
mice were anaesthetized, intestine was dissected and fixed in 4%PFA for 24h at 4°C. 
Following washes with PBS for another 24h at 4°C, the intestine was incubated 
in hydrogel solution for 24h at 4°C”’. After 3h at 37°C, the hydrogel-embedded 
intestine was placed in an X-CLARITY ETC chamber (LOGOS Biosystems) 
for electrophoretic tissue clearing for 7h. The cleared intestine was immuno- 
stained with goat anti-GFP (1:100; Abcam) or PDGFRa (1:100; R&D) and APC- 
conjugated anti-mouse EPCAM (1:100; Biolegend) or rabbit anti-EPCAM (1:100; 
Abcam). For FOXL1 antibody staining, following clearing, antigen retrieval was 
performed (as described above) in citrate buffer. Primary and secondary antibodies 
were incubated for 48h at 4°C each. The stained intestine was placed en bloc on an 
image slide using 1-mm depth adhesive silicone isolator, mounted in X-CLARITY 
mounting solution and imaged using confocal scanning. Z-stacks projections were 
compiled using Volocity software. 

Single-molecule FISH and immunofluorescence. Libraries of 48 probes each 
were designed against the mouse Wnt2b, Wnt5a, Sfrp1, Rspo3, Bmp5 and Dkk3 
mRNAs (Stellaris RNA FISH probes, Biosearch Technologies). Hybridization 
conditions and imaging were as described previously”*”’, except for the addition 
of immunofluorescence detection of PDGFRa. For immunofluorescence, a goat 
anti-PDGFRa antibody was diluted in hybridization buffer (1:100), added to the 
FISH probes and incubated overnight at 30°C. Secondary antibody Cy2-conjugated 
donkey anti-goat (1:200) was added to glucose oxidase (GLOX) buffer for 20 min 
in room temperature. smFISH imaging was performed on a Nikon-Ti-E inverted 
fluorescence microscope with a 100 x oil-immersion objective and a Photometrics 
Pixis 1024 CCD camera using MetaMorph software as previously described”*°. 
Single-cell segmentation and automatic transcript quantification were per- 
formed in a custom Matlab program (MATLAB release 2013b, Mathworks) using 
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TransQuant software*!. Images were filtered with a Laplacian of Gaussian filter of 
size 15 pixels and standard deviation of 1.5 pixels*”. 

Olfm4 mRNA in-situ hybridization. In situ hybridization for Olfm4 mRNA was 
performed using the RNAscope kit (Advanced Cell Diagnostics) according to the 
manufacturer's instructions. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. RNA-seq data that support the findings of this study have been 
deposited in the NCBI GEO under accession code GSE94072. Source data for 
Figs. 2, 3 and 4 are provided in the online version of the paper. 
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Extended Data Fig. 1 | Fluorescence-activated cell sorting plots. 
a-d, Representative FACS plots of mesenchymal cells isolated from 
Foxl1-cre;Rosa26-mTmG (c, d) as compared to control Rosa26-mTmG 
(a, b) mice, showing sorting strategy. In a and c, Allophycocyanin* 
(APC*), CD45* and EPCAM*-labelled cells were gated out to exclude 
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immune and epithelial cell contamination. In d, FOXL1*, GFP* and 
FOXL1~ Tomato‘ cells were gated and sorted based on fluorescence 
activity as compared to the negative control (b). Experiments were 
repeated at least three times with similar results. 
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population is clustered into three main groups. Jaccard coefficients for 
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Extended Data Fig. 3 | Derivation of Foxl1-creERT2;PorcnA mice. 

a, Schema for the generation of Foxl1-creERT2 mice using BAC 
recombineering. The coding sequence of exon 1 of Foxl1 was targeted by 
the sequence of creERT2. FRT, flipase recognition target; Flp, flipase; LA, 
left homology arm; RA, right homology arm. b, c, Tamoxifen induction 
of Foxll-creERT2;Rosa26-mTmG-driven expression of membrane-bound 
GFP to mesenchymal telocytes in the duodenum (b) and colon (c). 
Immunofluorescence staining for GFP (green), EPCAM (red). d, Schema 
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for the generation of Foxll-creERT2;PorcnA mice. Foxl1-creERT2 mice 
were crossed with mice carrying loxP sites flanking exons 3-7 of the 
X-linked porcupine homologue (Porcn) gene. e, Foxl1-creERT2;Rosa26- 
mTmG (control, n=5 mice) and Foxll-creERT2;PorcnA (PorcnA, n=8 
mice) male mice were treated with tamoxifen for three consecutive days to 
induce Cre expression and weighed every day. The slope of the weight loss 
was significantly different in PorcnA mice as compared to control mice 
(*P =0.0107, two-tailed linear regression analysis). 
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Extended Data Table 1 | Wnt pathway gene expression levels in different cell populations as assessed by RNA-seq (mean of 2-3 samples 
each in FPKM) 


Wnt pathway 


Gene name Foxl1* Expression level Mes Expression level Lgr5 Expression level Diff Expression level 
mean (FPKM) mean (FPKM) mean (FPKM) mean (FPKM) 


Wnt2 0.9742 0 0.1736 0 
Wnt2b 9.5831 0.1205 0.1559 0.1191 
Wnt3 0.3438 0 1.0915 0.0909 
Wnt4 1.7951 0.0102 0.9198 
Wnt5a 4.9932 0.1219 0.0786 
Wnt5b 0.8935 0.5514 0.3542 
Wnt6 0.0866 0.0824 0.3045 
Wnt9a 0.5072 0.0505 0.0553 
Wisp1 2.4082 0.0286 0 
Rspo2 0.4112 
Rspo3 2.1902 : 0.0141 

Lgr4 7.5384 48.6203 9.8020 
1.9220 19.3406 0.0408 
0.2264 
Znrf1 4.2325 0 3.6232 0.8523 


Znrf2 2.2594 5.0671 12.7731 7.7214 
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Znrf3 0.9130 0.1328 1.9908 0.1333 
Dkk2 1.8779 0 


Dkk3 36.5673 22.1995 0.0017 0.0077 
Wif1 1.2579 0 0.0125 0.0844 


Sfrp1 47.2422 0.6198 0.0722 0.0034 
Fzd2 2.50329 1.9856 0.0100 
Fzd4 4.5451 0.2025 0.9071 
Fzd6 0.5785 2.7795 0.2181 
Fzd7 1.1146 16.7951 2.5999 
Fzd8 0.6245 0 0.7510 0.3280 


Lrp1 11.4018 0.0144 6.4567 12.5752 
Lrp6 6.0929 0.0013 9.5234 4.2674 
Tcf4 15.0228 8.338 6.0325 0.9127 


Tcf15 1.71113 0 0.0090 0 
Tcf19 1.456 8.768 13.6231 0.7681 


Tcf21 19.6602 0.0065 0 0 


Tcf7I1 1.8914 0 0.1070 0.0623 
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De novo formation of the biliary system by TGFB- 
mediated hepatocyte transdifferentiation 


Johanna R. Schaub”, Kari A. Huppert”, Simone N. T. Kurial?-", Bernadette Y. Hsu!*, Ashley E. Cast’, Bryan Donnelly‘, 
Rebekah A. Karns’, Feng Chen!, Milad Rezvani!, Hubert Y. Luu, Aras N. Mattis®’, Anne-Laure Rougemont’, Philip Rosenthal”, 


Stacey S. Huppert?!°!5* & Holger Willenbring!71)8+ 


Transdifferentiation is a complete and stable change in cell 
identity that serves as an alternative to stem-cell-mediated organ 
regeneration. In adult mammals, findings of transdifferentiation 
have been limited to the replenishment of cells lost from preexisting 
structures, in the presence of a fully developed scaffold and niche’. 
Here we show that transdifferentiation of hepatocytes in the mouse 
liver can build a structure that failed to form in development—the 
biliary system in a mouse model that mimics the hepatic phenotype 
of human Alagille syndrome (ALGS)”. In these mice, hepatocytes 
convert into mature cholangiocytes and form bile ducts that are 
effective in draining bile and persist after the cholestatic liver 
injury is reversed, consistent with transdifferentiation. These 
findings redefine hepatocyte plasticity, which appeared to be 
limited to metaplasia, that is, incomplete and transient biliary 
differentiation as an adaptation to cell injury, based on previous 
studies in mice with a fully developed biliary system*°. In contrast 
to bile duct development”~’, we show that de novo bile duct 
formation by hepatocyte transdifferentiation is independent of 
NOTCH signalling. We identify TGF signalling as the driver of 
this compensatory mechanism and show that it is active in some 
patients with ALGS. Furthermore, we show that TGF® signalling 
can be targeted to enhance the formation of the biliary system from 
hepatocytes, and that the transdifferentiation-inducing signals 
and remodelling capacity of the bile-duct-deficient liver can be 
harnessed with transplanted hepatocytes. Our results define the 
regenerative potential of mammalian transdifferentiation and reveal 
opportunities for the treatment of ALGS and other cholestatic liver 
diseases. 

In regenerating organs of adult mammals, differentiated cells can 
replenish other types of differentiated cells by transdifferentiation, as 
in the pancreatic islet'®, gastric gland'', lung alveolus’? and intestinal 
crypt’?. Whether mammalian transdifferentiation can build these 
or other structures de novo is unknown. In the liver, hepatocytes 
can undergo biliary differentiation to form reactive ductules both in 
humans'*-!” and animals with cholestatic liver injury*°'*?!. However, 
hepatocyte-derived biliary cells exhibit incomplete biliary and residual 
hepatocyte differentiation, that is, they are not mature cholangiocytes, 
and revert back to their original identity after the injury is reversed*°. 
Moreover, hepatocyte-derived ductules do not contribute to bile 
drainage°. These findings are consistent with metaplasia, but not with 
transdifferentiation, and call into question the functional importance 
of hepatocyte plasticity. However, all studies of hepatocyte plasticity 
published so far used animals with a fully developed biliary system in 
which residual cholangiocytes are available to regenerate injured bile 


ducts, probably leading to insufficient pressure for transdifferentiation. 
To determine the full extent of hepatocyte plasticity, we used mice that 
lack the intrahepatic biliary system. 

Specifically, we used mice that mimic the hepatic phenotype of 
ALGS—a human disease caused by impaired NOTCH signalling** *4— 
generated by deletion of floxed alleles of the NOTCH signalling effector 
RBPJ and, to prevent compensation», the transcription factor HNF6 
(also known as ONECUT1) in embryonic liver progenitors using Cre 
expressed from an albumin (Alb) promoter”. These Alb-cre*/~ Rbpj! 
Hnf6/S mice are severely cholestatic because they lack peripheral bile 
ducts (pBDs), the branches of the intrahepatic biliary tree, at birth; 
however, more than 90% of the mice survive and form pBDs and a 
functional biliary tree by postnatal day (P) 12076 (Fig. 1a). Although 
Alb-cre*!~ RbpjHnf6"! mice have hilar bile ducts (hBDs), the trunk 
of the biliary tree, we hypothesized that the new pBDs originate from 
hepatocytes because the livers of Alb-cre+!~ Rbpj/Hnf6" mice contain 
hybrid cells that express hepatocyte and biliary markers””®. To test this 
hypothesis, we developed Cre-independent hepatocyte fate tracing. For 
this, we activated the Flp-reporter (GFP) in R26ZG*’* mice specifically 
in hepatocytes by intravenous injection of an adeno-associated virus 
(AAV) serotype 8 vector expressing Flp from the transthyretin (Ttr) 
promoter (AAV8-Ttr-Flp) (Fig. la and Extended Data Fig. 1a-g). 

To trace hepatocytes during pBD formation, we intravenously 
injected Alb-cre+'—RbpjHnfo"!R26ZG*'* mice with AAV8-Ttr-Flp 
at P17, which is before cells expressing biliary markers emerge in the 
liver periphery””®. At P120, the newly formed pBDs, identified by 
wide-spectrum (ws) cytokeratin (CK) immunofluorescence, were GFP 
positive, indicating hepatocyte origin (Fig. 1a, b). By contrast, pBDs 
that formed normally during liver development in Cre-negative litter- 
mates were GFP negative (Extended Data Fig. 2a, b). In many pBDs 
of Alb-cre*!~ RbpjHnf!'R26ZG*'* mice, all cells were GFP positive, 
whereas the overall labelling efficiency of peripheral cholangiocytes 
was 39.2 +7.2% (mean +s.e.m.); however, this number correlated well 
with the hepatocyte labelling efficiency (36.6 +4.8%; P= 0.48, paired 
two-sided Student's t-test) (Extended Data Fig. 2c), indicating that 
all peripheral cholangiocytes originated from hepatocytes. We also 
observed hepatocyte-derived peripheral cholangiocytes in mice in 
which hepatocytes were labelled after weaning (P39) (Extended Data 
Fig. 2d, e). These results show that hepatocytes can form pBDs de novo. 

To investigate the function of the hepatocyte-derived pBDs (HpBDs), 
we determined whether they are contiguous with the extrahepatic 
biliary system. At P30, retrograde ink injection into the common bile 
duct filled only hBDs, reflecting a severely truncated biliary tree that 
lacked pBDs; however, ink injection at >P120 revealed a biliary tree of 
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Fig. 1 | Hepatocytes can convert into peripheral cholangiocytes 

and form pBDs contiguous with preexisting hBDs. a, De novo pBD 
formation and hepatocyte fate tracing in Alb-cre*!" Rbpj/‘HnfO/R26ZG*'+ 
mice. Cells identified by dolichos biflorus agglutinin (DBA) lectin labelling 
and wsCK and GFP immunofluorescence. b, Immunofluorescence of 
hepatocyte-fate-traced P120 Alb-cre*!" Rbpj/Hnfo/R26ZG*'* mouse 

liver (n =7). ¢, Biliary tree visualized by retrograde ink injection into the 
common bile duct of P30 (n= 6), P120-P138 (n=6) and >P334 (n=6) 
Alb-cre*'~ RbpjHnfo mice, and P30 (n =3) and P120-P138 (n=5) 


normal dimensions, demonstrating that HpBDs are connected to hBDs 
(Fig. 1c). Three-dimensional (3D) reconstruction of confocal imaging 
confirmed that HpBDs form contiguous lumens with dolichos biflorus 
agglutinin (DBA)-labelled hBDs (Fig. 1d and Supplementary Video 1). 
Accordingly, HpBDs were effective in draining bile, as evidenced by 
normalization of serum markers of cholestasis (total bilirubin and 
alkaline phosphatase levels; Extended Data Fig. 3a—c) and hepatocyte 
injury (alanine aminotransferase and aspartate aminotransferase levels; 
Extended Data Fig. 3d, e) and resolution of cholestasis-induced liver 
fibrosis (Sirius red staining; Extended Data Fig. 3f). A few mice that 
continued to have increased levels of serum total bilirubin and liver 
fibrosis showed abundant wsCK-positive hepatocytes but no HpBDs 
(Extended Data Fig. 3a, b, g). In contrast to the transient hepatocyte- 
derived ductules observed in other mouse models of cholestatic liver 
injury**, HpBDs were stable beyond the time when cholestasis and liver 
injury were resolved and were maintained for life (>P334) (Fig. 1c, e). 
These results show that HpBDs provide normal and stable biliary 
function. 

We also investigated the authenticity and maturity of the cholangio- 
cytes that form the HpBDs. The cells displayed acetylated tubulin- 
marked primary cilia (Fig. 2a), indicating a switch from hepatocyte to 
biliary fate’®, and somatostatin receptor 2 (SSTR2) (Fig. 2b), a marker 
of biliary function?’. They also expressed the mature biliary markers 
epithelial cell adhesion molecule (EPCAM) and CK19 (Fig. 2c), 
which are lacking or expressed at low levels in hepatocyte-derived 
metaplastic biliary cells*>. To substantiate these results, we performed 
RNA-sequencing (RNA-seq) analysis on hepatocyte-derived peripheral 
cholangiocytes isolated as EPCAM-positive DBA-negative cells by 
FACS from >P115 Alb-cre*/~Rbpj//Hnfos mice (Extended Data 
Fig. 4a). Rbpj and Hnf6 were deleted in these cells (Extended Data 
Fig. 4b, c), which was expected because Alb-cre is active in embry- 
onic liver progenitors before they commit to hepatocyte or biliary 
fate®. Principal component analysis showed that hepatocyte-derived 
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left), 250 xm (c, P138 right), 100 j1m (b, c, P395 right, d, e, left), 251m 

(e, middle). 


peripheral cholangiocytes cluster closely with normal peripheral 
cholangiocytes isolated from Cre-negative littermates, but not 
with parental hepatocytes (Fig. 2d, e and Supplementary Table 1). 
Accordingly, hepatocyte-derived peripheral cholangiocytes expressed 
normal levels of mature biliary markers that are virtually undetectable 
in 3,5-diethoxycarbonyl-1,4-dihydrocollidine (DDC) diet-induced 
hepatocyte-derived metaplastic biliary cells*, except for CFTR, which 
is less enriched in cholangiocytes than the other markers (Fig. 2f). 
Hepatocyte-derived peripheral cholangiocytes also expressed other 
commonly used markers of biliary differentiation?*?°>?”8 at normal 
or near-normal levels (Fig. 2f), and showed virtually no memory of 
hepatocyte differentiation??? (Fig. 2g and Extended Data Fig. 4d). 
These results show that hepatocyte-derived peripheral cholangiocytes 
are authentic and mature peripheral cholangiocytes. 

Next, we investigated the contribution of proliferation to HpBD 
formation, and sought to determine whether a few hepatocytes 
proliferate extensively after transdifferentiation, or whether many 
hepatocytes transdifferentiate. For this, we measured clonal expansion 
in HpBDs by sparsely labelling hepatocytes in Alb-cre+/~RbpjHnfo!! 
R26ZG*'* mice with low-dose AAV8-Ttr-Flp at P18 and quantifying 
the cells in GFP-positive clones at P120 (Fig. 3a). Surprisingly, we 
found only 1.56 + 0.05 cells per clone in HpBDs, with one-cell clones 
accounting for 63.8 + 3.6% of clones. To exclude the possibility that 
the nonintegrating AAV vector missed proliferating hepatocytes, 
we analysed clone size in Alb-cre+'~Rbpj‘Hnf6"/R26R-Confetti*'— 
mice in which hepatocytes and hilar cholangiocytes are stochas- 
tically labelled with one of four fluorescent proteins (Fig. 3b and 
Supplementary Video 2). We found 1.91 +0.06 cells per clone and 
45.6 + 1.1% one-cell clones in HpBDs at P150. In addition, the number 
of cells per clone in hBDs was similar between these mice and 
Alb-cre*'” R26R-Confettit'/~ control mice at P90 (2.08 + 0.08 and 
2.41 40.01; P=0.060, two-sided Student’s t-test) (Extended Data 
Fig. 5a). These results show that HpBDs form with little proliferation 
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Fig. 2 | Hepatocyte-derived peripheral cholangiocytes are equivalent 
to normal mature peripheral cholangiocytes. a—c, Immunofluorescence 
of hepatocyte-fate-traced P120 Alb-cre*’~ RbpjHnfo"/R26ZG*'*+ mouse 
liver (n =3 each). acTUB, acetylated tubulin. Scale bars, 100m (c), 

20 um (a, b). d~g, RNA-seq analysis of normal peripheral cholangiocytes 
(pC; n =3 mice), hepatocyte-derived peripheral cholangiocytes (HpC; 
n=4 mice) and RBPJ- and HNF6-deficient hepatocytes (H; n=3 

mice). d, Principal component analysis. e, Venn diagram showing the 
number of genes significantly differentially up- and downregulated in 

pC or HpC versus H. f, Heat maps of genes reflecting cholangiocyte 
differentiation, including genes lacking in DDC diet-induced hepatocyte- 
derived metaplastic biliary cells (top) and other marker genes (bottom). 
g, Heat maps of genes reflecting hepatocyte differentiation, including all 
differentially expressed cytochrome P450 (CYP) genes enriched in adult 
mouse liver (top) and other marker genes (bottom). One-way ANOVA, 
false-discovery rate (FDR)-corrected P < 0.05; fold change > 3 (e-g). 
Two-sided Student's t-test; bold genes P < 0.05 for HpC versus pC (f, g). 


and are thus polyclonal, and confirm that hilar cholangiocytes do not 
contribute to HpBD formation. 

Unlike in HpBDs, we found significant proliferation in hepato- 
cyte-derived reactive ductules, which are detectable in Alb-cre*/~Rbpj!! 
Hnf6"'R26ZG*!* mice during cholestasis (Extended Data Fig. 5b). 
Clonal analysis showed 2.70 + 0.08 cells per clone in reactive ductules 
in P90 Alb-cre*!—RbpjHnf6"R26R-Confetti*/~ mice, with more three- 
and four-cell clones than in pBDs in Alb-cre*/" R26R-Confettit'~ mice 
(Extended Data Fig. 5c). Because HpBDs form with little proliferation, 
we reasoned that these proliferating cells are hepatocyte-derived 
metaplastic biliary cells*-®. Indeed, we found that 93.51 + 0.81% of the 
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proliferating biliary cells identified by KI67 and osteopontin (OPN) 
immunofluorescence in Alb-cre*/~ Rbpj!Hnf6" mice near the peak of 
cholestasis (P54) were CK19 negative (Extended Data Fig. 5d). 

We also investigated proliferation in established HpBDs by inducin 
cholestatic liver injury with a DDC diet in >P120 Alb-cret’- Rbpj!” 
Hnfo mice. Reactive ductules formed 2 weeks later than in 
Cre-negative littermates, and consisted mainly of OPN-positive 
EPCAM-negative cells, indicative of hepatocyte metaplasia (Extended 
Data Fig. 5e). Indeed, after feeding a DDC diet to Alb-cret/~ Rbpj!! 
Hnf6"'R26ZG*'* mice in which hepatocyte fate tracing was induced 
at >P120, we found that 84.5 + 5.6% of the cells in reactive ductules 
originated from hepatocytes, in contrast to 20.1 + 3.0% in Cre-negative 
littermates (Extended Data Fig. 5f). These results confirm that NOTCH 
signalling is important for cholangiocyte proliferation*!, which explains 
our finding of a limited role of proliferation in HpBD formation 
and underscores the authenticity of hepatocyte-derived peripheral 
cholangiocytes. 

As illustrated by a lack of pBDs in Alb-cret'~ Rbpj/Hnfo mice 
(Fig. 1c), NOTCH signalling is needed for bile duct development’, 
raising the question of which signalling pathway drives HpBD forma- 
tion in its absence (Extended Data Fig. 4b, c). We focused on TGF3 
signalling because it promotes biliary differentiation and morpho- 
genesis in development, although it is not essential, as shown by the 
development of a normal biliary tree in mice that lack the TGFG type 
II receptor (TGFBR2) in embryonic liver progenitors (Extended Data 
Fig. 6a). We reasoned that TGF@ signalling is induced in hepatocytes of 
Alb-cret'- Rbpj!Hnfo!! mice because of the paradoxical role of HNF6 in 
liver development—it activates the biliary transcription factors HNF18 
and SOX9*°3, but inhibits TGFBR2*?"+. Indeed, Gene Ontology (GO) 
term enrichment analysis of our RNA-seq data suggested active TGF3 
signalling in hepatocyte-derived peripheral cholangiocytes, but not 
in normal peripheral cholangiocytes (Supplementary Table 1, down 
pC versus HpC). Moreover, we found high levels of phosphorylated 
SMAD3 (pSMAD3) in the nuclei of periportal HNF1-positive epithelial 
liver cells and in whole-liver nuclear extracts in Alb-cre*!~ Rbpj!"Hnfo"! 
mice at P60 (Extended Data Fig. 6b, c). 

We functionally validated these findings by showing that the TGFS 
inhibitor SB-431542 blocks biliary differentiation and morphogene- 
sis of hepatocytes that lack RBPJ and HNF6 in vitro (Extended Data 
Fig. 6d-f). Moreover, at >P120, HpBDs were still absent or severel 
truncated, and liver fibrosis persisted in 14 out of 16 Alb-cret! ~Rbp} if 
Hnfo!Tgfbr2!! mice that, like Alb-cre*'~ Rbpj/Hnfo mice, were 
cholestatic at P34—P53 (Fig. 3c, d and Extended Data Fig. 6c, g). 

These findings led us to investigate whether activating TGF6 
signalling in hepatocytes enhances HpBD formation. We intravenously 
injected P19-P24 Alb-cret!~ Rbpj/Hnf6" mice with an AAV serotype 
8 vector expressing constitutively active TGFBR1 from the eukaryotic 
translation elongation factor 1al1 (Eefla1) promoter (AAV8-Eeflal- 
caTgfbr1). At P80-P100, 9 out of 11 treated mice had a mature 
hierarchical biliary network, whereas 9 out of 9 untreated mice still 
had an immature homogeneous biliary network* (Fig. 3e). Reflecting 
improved bile drainage, cholestasis and liver fibrosis resolved faster in 
treated mice (Fig. 3f, g). We ruled out the possibility that AAV8-Eeflal- 
caTgfbrl is fibrogenic in Rbpj/“Hnfo" mice (Extended Data Fig. 6h). 
These results identify TGF8 signalling as the driver of transdifferenti- 
ation and morphogenesis in HpBD formation. 

To exclude the possibility that transdifferentiation is limited to 
immature hepatocytes, we investigated whether adult hepatocytes 
can form HpBDs. To bypass potential adaptive processes in devel- 
opment, we deleted Rbpj and Hnf6 and activated GFP in hepatocytes 
of P75 Rbpj“Hnf6"/R26ZG*'* mice by co-injecting AAV8-Ttr-Cre 
and AAV8-Ttr-Flp and transplanted the cells 1 week later into P31 
Alb-cre*'~ Rbpj!"Hnfo"!Rag2~/~Tl2rg-'~ mice (Fig. 4a). After P120, we 
found donor hepatocyte-derived mature peripheral cholangiocytes in 
5.2 + 0.8% of the portal areas containing donor cells (Fig. 4b). We also 
transplanted NOTCH signalling-competent wild-type hepatocytes, 
which produced donor-derived HpBDs in 78.3 + 13.9% of such portal 
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Fig. 3 | HpBD formation entails little proliferation and is driven by 
TGF® signalling. a, Possible outcomes, maximum projection image and 
size distribution of clones in ele -fate-traced P120 Alb-cret!- Rbpj 
Hnfo''R26ZG*'* mice (n= 3). b, Possible outcomes, image stack volume 
projection and size poem of clones in P150 Alb-cre*!- RbpjHnfo"t 
R26R-Confettit'~ (n= 4) and Alb-cre*!~ R26R-Confetti*'~ (n=3) mice. 

c, Ink visualization of biliary tree of >P120 Alb-cre*/~ Rbpj!Hnfo''Tgfbrai 
(n= 16), RbpjHnfo"!Tgfbr2"! (n= 4) and Alb-cret!~ Rbpj!"Hnfo"! (n= 1) 
mice. d, Sirius red staining with quantification in >P120 Alb-cret’- Rbpj'! 
HnfoI Tgfbr2 (n= 4), Rbpj"HnfoTgfbr2! (n = 2) and Alb-cret!- Rbpj! 


areas because 31.6 + 7.9% of the transplanted cells proliferated after 
transdifferentiation into cholangiocytes (Fig. 4a, c, d). These results 
show that adult hepatocytes, and transplanted hepatocytes, respond 
to the transdifferentiation-inducing and morphogenetic signals in the 
bile-duct-deficient liver and form HpBDs. 

To determine whether our findings are relevant for human ALGS, we 
obtained liver samples from two patients who developed regenerative 
nodules containing pBDs** (Extended Data Table 1). The regenera- 
tive nodules contained CK7-positive pBDs, whereas nonregenerated 


a Alb-cre*-Rbpj‘* 
pea ‘Hnf6R26ZG*"* Hnfe" ee ee Fe 


P75 P82 


t | —— 


AAV8-Ttr-Cre Hepatocyte isolation 
AAV8-Ttr-Flp and fa. 


GFP EPCAM Ic 


AAV8-Ttr-Cre 


ZsGreen EPCAM 


| R26R- ciel te 
Son Se te 


DAPI 


Regenerative nodule 


Regenerative nodule Nonregenerated liver tissue f 


CK7 


250 | NATURE | VOL 557 | 10 MAY 2018 


> 
‘ales 20 
he 

analysis 


Hnf6!’S (n= 2) mice. Dotted lines represent the mean values of the 
indicated P120 mice from Extended Data Fig. 3f. e, f, Ink visualization 

of biliary tree and Sirius red staining with quantification in P100 
Alb-cre*'~ Rbpj'Hnf6"! mice that did (n = 9) or did not (n =8) receive 
AAV8-Eeflal-caTgfbr1. *P = 0.045, two-sided Student's t-test. g, Serum 
total bilirubin in P36 Alb-cret’- RbpjHnf6" mice that did (n =6) or did 
not (n= 8) receive AAV8-Eeflal-caTgfbr1. *P = 0.047, two-sided Welch's 
t-test. Horizontal lines in a, b, d, f and g denote mean values. Scale bars, 
2mm (c), 500m (e), 100m (d, f), 50 um (a), 20,1m (b). 


liver tissue from the same patients showed abundant CK7-positive 
cells with hepatocyte morphology, indicative of metaplasia!*-'” 
(Fig. 4e). To determine whether TGF@ signalling is active in the 
new pBDs, we used pSMAD3 immunofluorescence. We found nuclear 
localization of pSMAD3 in 56.1 + 6.1% of the pBDs in regenerative 
nodules, but not in pBDs in normal human liver (Fig. 4f). These results 
suggest that the TGF3-mediated mechanism of HpBD formation iden- 
tified in Alb-cre+!~Rbpj/‘Hnf6"! mice is also active in some patients 
with ALGS. 


Fig. 4 | Clinical relevance and therapeutic 
potential of HpBD formation. 

a, Experimental design for hepatocyte 
transplantation. b, Immunofluorescence of 
P127 mouse (n= 5) transplanted with adult 
GFP-expressing RBPJ- and HNF6-deficient 
hepatocytes. c, Immunofluorescence 

of P152 mouse (n= 4) transplanted 

with adult RFP-expressing hepatocytes. 

d, Immunofluorescence of liver of 

P72 mouse (n = 2) transplanted at P43 

with hepatocytes isolated from P287 
Alb-cre*!~R26R-ZsGreent!* mouse. 

e, f, Immunohistochemistry (e) and 
immunofluorescence (f) of ALGS (n= 2) and 
normal (n= 1) human livers. Arrowheads 
indicate nuclear psMAD3 in pBDs. Scale 
bars, 100m (b, ¢, e, f), 50 jum (d). 
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In conclusion, by showing that hepatocytes can convert into mature 
cholangiocytes and form a functional and stable biliary system, our find- 
ings establish that hepatocyte plasticity extends beyond metaplasia to 
transdifferentiation, and provide the first example, to our knowledge, of 
mammalian transdifferentiation building an organ structure de novo. 
Although hilar cholangiocytes are present in our mouse model, they fail 
to expand, resulting in severe cholestatic liver injury and strong pres- 
sure for hepatocytes to transdifferentiate into peripheral cholangiocytes. 
Analogously, cholangiocytes transdifferentiate into hepatocytes only 
when hepatocyte proliferation is completely suppressed*”. The failure 
of hilar cholangiocytes to proliferate is probably caused by high TGF8 
signalling in the cholestatic liver**. Accordingly, we identified TGFB 
signalling as the driver of hepatocyte transdifferentiation and HpBD for- 
mation in our mouse model, and potentially also in patients with ALGS. 
Unlike for bile duct development’”-°, NOTCH signalling is not needed. 
Using clinically established AAV vectors and hepatocyte transplantation, 
we show that our findings are potentially translatable into therapies for 
ALGS and other diseases associated with a lack of bile ducts. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0075-5. 
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METHODS 

Mice. Alb-cre+/~ Rbpj/Hnfo" mice*® (mixed background), R26R-RFP*/+ mice? 
(C57BL/6) and R26NZG*/*+ mice*” (FVB) were previously reported. Flp-reporter 
mice were generated by crossing R26NZG*/* mice with Ella-cre*!* mice‘! 
(C57BL/6) to remove the Cre-reporter element and then crossing out the Ella-cre. 
These R26ZG*!* mice were crossed with Alb-cre*!~ Rbpj"Hnf6" mice to gener- 
ate Alb-cre+!~ Rbpj!/HnfO"/R26ZG*'* mice. R26R-Confettit/+ mice (C57BL/6) 
were crossed with Alb-cre*/~RbpjHnfo mice to generate Alb-cre*!- Rbpj/! 
Hnf6/R26R-Confetti*'~ mice. Tgfbr2/ mice** (C57BL/6) were crossed with 
Alb-cre+!~ Rbpj!Hnf6"! mice to generate Alb-cre+'— RbpjHnfo" Tgfor2! mice. 
Because Tgfbr2 (68.39 cM) and Hnf6 (41.93 cM) are both on chromosome 9, recom- 
binants were generated at 0.1356 (35 out of 258 mice) observed frequency (0.26 
expected frequency). Different founder recombinants were intercrossed to generate 
Alb-cre*!~ Rbpj"Hnfo"!Tgfbra! mice. R26R-ZsGreen*!* mice (C57BL/6) were 
crossed with Alb-cre*’~ mice to generate Alb-cre*!R26R-ZsGreen*'* mice. 
Alb-cre*!~ Rbpj!Hnf6"! mice were crossed with Rag2~/~Il2rg~/~ mice**6 (mixed 
background) to generate Alb-cre+/- Rbpj!/Hnf"Rag2~/~Il2rg~/~ mice. Male and 
female mice of the indicated age and genotype were chosen randomly for inclusion 
in experiments. The mouse used as a positive control for biliary gene expression 
was a C57BL/6 wild-type mouse fed choline-deficient diet (MP Biomedicals) 
and given 0.15% (w/v) ethionine (Sigma-Aldrich) in the drinking water (CDE 
diet) for 3 weeks. All mice were kept under barrier conditions. All procedures 
were approved by the Institutional Animal Care and Use Committee at UCSF or 
CCHMC, 

Adeno-associated virus. The AAV-Ttr-Flp plasmid was generated by removing 
Cre from AAV-Ttr-Cre‘’ and replacing it with Flpo** from pPGKFlpobpA 
(Addgene 13793); viruses were produced by Vector Biolabs and used at a high 
dose of 1 x 10!*-3 x 10!” vg or low dose of 4 x 101! vg. The AAV-Eeflal-caTgfbr1 
plasmid was built by VectorBuilder to contain the activating T204D° mutation; 
virus was produced by Vector Biolabs and used at a dose of 1 x 10"! vg. Titres were 
determined using qPCR. Viruses were delivered by tail vein injection in volumes 
<100 ul to prevent hydrodynamic effects. 

Tissue collection. Livers from Alb-cret’~ Rbpj!/Hnfo"/R26ZG*'*, R26R-Confetti!- 
and transplanted mice were perfused with ice-cold PBS followed by 4% paraform- 
aldehyde (PFA). Samples were cut into slices and fixed overnight in 4% PFA at 4°C. 
Livers from R26R-RFP*!*, Alb-cre+!— RbpjHnfo"!, Rbpj'Hnfo"! and Alb-cre+!— 
Rbpj!'Hnfo"'R26ZG*'* mice were fixed overnight in 4% PFA at 4°C. For thin 
sectioning, samples were moved to 30% sucrose overnight at 4°C to cryopreserve 
and then embedded in OCT (Tissue-Tek) or processed for paraffin embedding 
and sectioning. A Leica CM3050 S cryostat was used to cut 6}1m cryosections for 
staining and imaging. For 3D and sparse-labelling clonal analysis, liver samples 
were embedded in 4% agarose and sectioned on a Leica VT100 S vibratome. For 
clonal analysis in R26R-Confettit'~ mice, liver samples were cut into 1 mm slivers, 
immunostained, washed in PBS, equilibrated in 30% sucrose and embedded in 
OCT. The 1 mm slivers were then cryosectioned at 100 1m, stained with Hoechst 
and mounted in PBS for confocal imaging. For analysis of Flp-based hepatocyte fate 
tracing in R26ZG*!* mice, liver samples were fixed in neutral-buffered formalin 
containing zinc (Z-Fix, Anatech), embedded in paraffin and sectioned to 541m. 
Human liver tissue. Explant samples from the regenerative nodule and nonre- 
generated liver tissue of a 3-year-old male patient with ALGS were previously 
described; regenerative nodule and nonregenerated liver tissue contained the same 
heterozygous JAGI exon 1-26 deletion**. Samples were obtained with patient con- 
sent and approval from the Commission Cantonale d’Ethique de la Recherché 
CCER. Biopsy samples from the regenerative nodule and nonregenerated liver tis- 
sue of a 15-year-old male patient with ALGS caused by a heterozygous c.499T > A 
(p.W167R) JAG1 mutation and resection samples from a histologically normal 
region of the liver of a 35-year-old male undergoing surgery for metastasis of rectal 
adenocarcinoma were obtained with patient consent and approval from the UCSF 
Institutional Review Board. 

Immunostaining and histology. Cryosections were blocked in 10% normal serum 
and permeabilized in 0.3% Triton-X before staining with primary and secondary 
antibodies listed in Supplementary Tables 2 and 3. Paraffin-embedded samples 
were deparaffinized and underwent antigen retrieval in sodium citrate buffer 
(Bio-Genex or Vector Labs) or Tris EDTA buffer (10 mM Tris Base, 1 mM EDTA, 
0.05% Tween 20, pH 9.0) in a steamer or pressure cooker for 15 min before blocking 
and permeabilization. Thin sections were mounted in FluorSave (MilliporeSigma) 
or 50% glycerol. Vibratome sections were stained free-floating in 12-well dishes 
and cleared in Focus Clear (Cedarlane Laboratories) before being mounted in 
Mount Clear (Cedarlane Laboratories) for confocal imaging. All samples stained 
with anti-wsCK antibody underwent antigen retrieval in 100 mM Tris buffer 
(pH 9.5) at 55°C for 2h before blocking. Staining with anti-pSMAD3 antibody 
used a biotinylated secondary antibody followed by avidin/biotin-based peroxi- 
dase and tyramide amplification. Staining with anti-SSTR2 antibody used an HRP 
secondary antibody followed by tyramide-based amplification. M.O.M. Kit (Vector 


Laboratories) was used for antibodies raised in mice. DAPI or Hoechst was used to 
stain nuclear DNA. Chromogenic detection used a horseradish peroxidase (HRP) 
secondary antibody followed by avidin/biotin-based peroxidase amplification and 
DAB substrate exposure. Sirius red staining was performed on deparaffinized and 
rehydrated samples using a 0.1% Picrosirius red solution (Dudley Corporation and 
Newcomer Supply) and 0.5% acetic acid water washes or was done at Peninsula 
Histopathology Laboratory. 

Imaging. Thin sections were imaged on an Olympus BX51 upright microscope 
and cultured cells were imaged on an Olympus IX71 inverted microscope using 
Openlab software (PerkinElmer). Chromogenic stains were imaged on a Leica 
DM 1000 LED. Fiji*® and/or Photoshop (Adobe) were used to process (brightness, 
contrast and gamma) and merge channels. Thick sections for 3D analysis of 
connectivity and clonal analysis were imaged on a Leica upright AOBS confocal 
microscope and processed and analysed using Imaris (Bitplane) or Volocity 
(PerkinElmer) software. For R26R-Confettit/~ mouse and pSMAD3 analysis, 
images were acquired using a Nikon A1R GaAspP inverted SP confocal microscope 
and NIS elements software and processed and analysed using Imaris software. 
Sirius red-stained sections were imaged using a Cytation 5 cell imaging multi-mode 
reader (BioTek). 

Ink injection. A catheter was inserted in a retrograde fashion into the common 
bile duct of post-mortem mice and waterproof ink (Higgins) was slowly injected. 
Left liver lobes were dehydrated in 1:1 methanol:water followed by 100% methanol. 
Ink was visualized by tissue clearing in 1:2 benzyl alcohol:benzyl benzoate (BABB) 
solution and imaged on a Leica M205A or Nikon SMZ800 stereoscope. 
Cholangiocyte isolation. Nonparenchymal liver cells were isolated from >P115 
Alb-cre*!~ RbpjHnfo and Rbpj"Hnf6! mice as previously described*!. Cells were 
resuspended at 1 x 10” cells ml~' in DMEM/2% EBS and blocked with Mouse 
Fc Block (BD Biosciences) for 30 min. Cells were incubated with fluorochrome- 
conjugated antibodies (Supplementary Table 2) and DBA-FITC (Vector 
Laboratories) for 30 min, washed with cold Dulbecco's PBS (DPBS) three times 
and resuspended in DMEM/2% FBS. Sytox Red (Thermo Fisher Scientific) was 
added to label dead cells before sorting. Unstained and single-stained cells were 
used for compensation. Specificity of DBA binding was verified with a GalNAc 
(Sigma)-blocked control as previously described”. Cells were analysed and sorted 
on a FACSAria III using FACSDiva software (BD Biosciences). From the CD11b™ 
CD31-CD45~ population, EPCAM*DBA cells were collected as peripheral 
cholangiocytes and EPCAM*DBA* as hilar cholangiocytes. FlowJo (FlowJo, 
LLC) was used to analyse data and generate charts. Cells were either sorted into 
DMEM/2% FBS, pelleted and snap frozen, or sorted directly into extraction buffer 
for RNA purification. 

Hepatocyte isolation. Hepatocytes were isolated from >P115 Alb-cre+!—Rbpj/! 
Hnf6! mice by two-step collagenase (Worthington) perfusion followed by puri- 
fication through a Percoll gradient. Cells were resuspended at 1 x 10° cells per 
100 jl in Hanks buffer with 10% FBS and incubated with OC2-2F8 antibody 
(Supplementary Table 2) for 1h on ice. Cells were washed with cold DPBS twice 
and resuspended in Hanks/10% FBS. Fluorochrome-conjugated secondary anti- 
body (Supplementary Table 3) was added and cells were incubated for 30 min on 
ice followed by two washes with cold DPBS. Cells were blocked with 5% normal 
rat serum (Jackson Immuno) in Hanks buffer for 10 min on ice. Cells were then 
incubated with fluorochrome-conjugated antibodies (Supplementary Table 2) for 
30 min on ice. Cells were then washed in cold DPBS three times and resuspended 
in Williams E medium/2% FBS. Sytox Red (Thermo Fisher Scientific) was added 
to label dead cells before sorting. Unstained and single-stained cells were used for 
compensation. Cells were analysed and sorted on a FACSAria III using FACSDiva 
software (BD Biosciences). From the CD11b-CD31-CD45-EPCAM” population, 
OC2-2F8* cells were collected as hepatocytes (Supplementary Fig. 2). Cells were 
either sorted into DMEM/2% FBS, pelleted and snap frozen, or sorted directly into 
extraction buffer for RNA purification. 

Hepatocyte transplantation. Hepatocytes were isolated from donor mice by two- 
step collagenase perfusion followed by purification through a Percoll gradient. 
1 x 10° viable cells were resuspended in 80-100 ul of Williams E medium with 
glutamine. Transplantation was performed by transdermal intrasplenic injection 
of the cell suspension under isoflurane anaesthesia. 

DDC diet feeding. Mice received PicoLab Mouse Diet 20, 5058 (LabDiet) with 
0.1% 3,5-diethoxycarbonyl-1,4-dihydrocollidine (DDC; Sigma-Aldrich) for the 
indicated durations. 

qPCR and gene expression analysis. Genomic DNA was isolated from cells 
using QlAamp DNA Micro Kit or DNeasy Blood & Tissue Kit (Qiagen). RNA 
was extracted using Trizol Reagent (Thermo Fisher Scientific) and purified by 
isopropanol precipitation (cells in collagen gel) or purified using RNAeasy Mini 
Kit (Qiagen) (liver tissue). Reverse transcription was performed using qScript 
cDNA Supermix (Quanta Biosciences). qPCR was performed using SYBR green 
reagent (Thermo Fisher Scientific) in a ViiA 7 system (Thermo Fisher Scientific). 
Reactions were performed in triplicate, and expression was normalized to an Rbpj 
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(genotyping) or Gapdh (gene expression) reference and quantified using the AAC, 
method. Primers are listed in Supplementary Table 4. 

RNA-seq. RNA was purified from FACS-isolated cells using PicoPure Kit (Thermo 
Fisher Scientific). RNA quality was assessed using RNA 6000 Pico Kit on a 2100 
Bioanalyzer (Agilent). Samples with an RNA integrity number (RIN) > 7.7 and 
at least 15 ng of RNA were used to construct sequencing libraries using Clontech 
Low Input Library Prep Kit v2. Libraries were sequenced on a HiSeq 3000, 10 
samples per lane, with single-end 50 bp reads. Raw reads were aligned to the 
mm 10 mouse genome with annotations provided by UCSC using CobWEB, a 
proprietary Burrows-Wheeler Transform method. Reads per kilobase per million 
(RPKM) were calculated from aligned reads using the expectation-maximiza- 
tion algorithm. RPKM was thresholded at 1, logy transformed, normalized using 
the DESeq algorithm and baselined to the median of all samples. Analyses were 
performed on transcripts with RPKM > 5 in all samples of at least 1 experimental 
condition (m= 17,793 transcripts). These reasonably expressed transcripts were 
used in principle component analysis. All transcripts with fold change > 3 in at 
least 1 of the 3 possible pairwise comparisons (n= 6,464) were selected, and a 
one-way ANOVA was performed to identify significantly differential genes with 
FDR-corrected P< 0.05 (n= 4,997). Venn diagrams were used to identify unique 
and shared gene signatures. Gene sets were submitted to https://ToppGene.cchmce. 
org for identification of pathway and biological process enrichments. 

Western blot. Nuclear and cytoplasmic extracts were generated from whole liver 
using previously described buffers with protease and phosphatase inhibitors”. 
Samples were run on SDS-PAGE 420% Tris-glycine gradient gels, electrophoret- 
ically transferred to nitrocellulose membrane and probed with antibodies. Signals 
were detected by ECL western blotting substrate (GE Healthcare). The membrane 
was stripped and reprobed with anti-actin antibody to verify and normalize protein 
loading using densitometry. Quantification was performed using Image] software. 
Serum chemistry. Blood was collected from post-mortem Alb-cret!— Rbpj!! 
Hnfo! mice and controls of ages P20 to >P150 and tested for serum total bilirubin 
(TecoDiagnostics) and alkaline phosphatase, alanine aminotransferase and aspartate 
aminotransferase (ADVIA XPT clinical chemistry system, Siemens). For serial 
bilirubin measurements, blood was collected by retro-orbital venipuncture and 
tested for total bilirubin every 2 weeks from P60 to P189 in four different litters. 
Blood was collected from Alb-cre+!~ Rbpj/Hnfo"Tgfbr2" mice and controls by 
retro-orbital venipuncture and tested for serum total bilirubin (TecoDiagnostics). 
Blood was collected from Alb-cre*!~ RbpjHnf6! mice intravenously injected with 
AAV8-Eeflal-caTgfbr1 and controls by retro-orbital venipuncture and serum 
total bilirubin was measured as previously reported™ using a Synergy 2 microplate 
reader (BioTek). Serum absorbance at 540 nm was subtracted from serum absorb- 
ance at 450 nm. A linear trendline equation of serum absorbance versus serum 
total bilirubin was determined by independently measuring absorbance and 
total bilirubin (TecoDiagnostics) of known cholestatic and noncholestatic serum 
samples. This equation was used to convert absorbance readings to serum total 
bilirubin levels. 

In vitro conversion assay. In vitro 3D culture of hepatocytes to induce biliary con- 
version was carried out as previously reported”! with the following modifications. 
Hepatocytes were isolated from 8-10-week-old RbpjHnf6"! mice injected with 
1 x 10! vg of AAV8-Ttr-Cre 2 weeks before to delete Rbpj and Hnf6. Hepatocytes 
were isolated by two-step collagenase perfusion followed by purification through 
a Percoll gradient, all in the absence of serum. Cells were grown on Primaria plates 
for 6 days to form spheroids. Spheroids were cultured in collagen gels (Cultrex 
Rat Collagen I, Lower Viscosity, Trevigen) in the presence of 101M SB-431542 
(Selleckchem) or vehicle (DMSO). The medium was changed every other day. 
Quantification and statistics. For sparse-labelling clonal analysis, eight liver 
regions from each of three Alb-cre*!~ Rbpj‘Hnfo"'R26ZG"'* mice were analysed. 
GFP-positive cells in all clones within 100 j1m-thick sections were manually 
counted using Imaris software for visualization. GFP-positive cells in direct contact 
were considered clones. Clones that extended to and potentially beyond the x, y or z 
boundaries were excluded. For clonal analysis in R26R-Confetti*! ~ mice, 30-40 jm 
z-stack images of wsCK-positive DBA-negative pBDs and wsCK-positive DBA- 
positive hBDs were visualized in 3D with Imaris software. The module ‘surfaces’ 
within Imaris was used to render a 3D surface created on an intensity value on a per 
channel basis. The rendered surface per channel was masked to the Hoechst nuclear 
stain. 3D pBD masks were used to manually count cells per clone. Multiple portal 
regions were analysed for clones at P90 (14 and 17 per Alb-cre*!~ RbpjHnfo'! 
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R26R-Confetti*'~ and 8 and 9 per Alb-cre+/“ R26R-Confettit/~ mouse) and P150 
(7-10 per Alb-cre!~ Rbpj!Hnf6//R26R-Confettit'~ and 8-10 per Alb-cre+!~ R26R- 
Confettit'~ mouse). To determine labelling efficiencies in Alb-cre+!" Rbpj/Hnfo!? 
R26ZG*!* mice, for each mouse, 20 random fields were analysed for wsCK-positive 
DBA-negative peripheral cholangiocytes (~350 cells) and 5 random fields were 
analysed for hepatocytes (~1,500 cells). For proliferation analysis, OPN-positive 
cells in four random portal fields from each of four mice were analysed for K167 
and CK19 staining. For hepatocyte-fate-tracing analysis in DDC diet-fed mice, 
ten random portal fields from four Alb-cre+/~ Rbpj/Hnfo/R26ZG*'* and three 
Rbpj'Hnfo"/R26ZG*'* mice were analysed. For quantification in transplantation 
experiments, all portal areas with donor-derived cells in a section from at least 
three lobes per mouse were examined and scored for the presence or absence of 
donor hepatocyte-derived EPCAM-positive cholangiocytes. For human samples, 
63 bile ducts between the 2 patient samples and 45 bile ducts in the control sample 
were scored for the presence of nuclear psMAD3. For quantification of Sirius 
red staining, liver samples were stained in batches and sections of whole lobes 
were imaged at equal exposure. Using Fiji software, a threshold was set for Sirius 
red-positive area within each lobe and the percentage of the total area that was 
Sirius red positive was measured and is reported. Charts were generated in Prism 
6 or 7 (GraphPad). Researchers were not blinded when analysing results. P< 0.05 
was considered statistically significant. Experiments were replicated independently 
once (Fig. 2a, b, d-g, 3a, b, d, g, Extended Data Figs. 1c—g, 2c, e, 3a-g, 4b-d, 5a, c, d, 
f, 6c, g; Supplementary Table 1), at least twice (Fig. 1d, e, 2c, 3f, 4b-f, Extended Data 
Figs. 5b, e, 6a, b, e, f, h; Supplementary Video 1) or at least three times (Fig. 1b, c, 
3c, e, Extended Data Figs. 2b, 4a; Supplementary Video 2). No statistical methods 
were used to predetermine sample size. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. The Gene Expression Omnibus (GEO) accession number for the 
RNA-seq data is GSE108315. Additional data are available from the corresponding 
authors on reasonable request. 
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Extended Data Fig. 1 | Flp-based hepatocyte fate tracing. a, R26ZG 
allele. b, Experimental design for establishing efficient, specific and 
constitutive labelling of hepatocytes in normal adult R26ZG*'* mice. 
c-f, Immunofluorescence of R26ZG*/* mouse liver (n = 2) for GFP and 
the hepatocyte marker major urinary protein (MUP) (c), peripheral and 
hilar cholangiocyte marker CK19 (d, e), hilar cholangiocyte marker DBA 
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AAV8-Ttr-Flp Liver analysis 


2 weeks 
(e), hepatic stellate cell marker desmin (DES), macrophage marker F4/80 
and endothelial cell marker LYVE1 (f) 2 weeks after intravenous injection 
of 1 x 10!* viral genomes (vg) of AAV8-Ttr-Flp. g, Reporter activation in 
R26ZG*'* mice 1 and 2 weeks after intravenous injection of the indicated 
dose of AAV8-Ttr-Flp (n= 1 for each dose and time point). Scale bars, 
100 1m. 
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Extended Data Fig. 2 | Efficiency of hepatocyte fate tracing in mice Alb-cre+!~ RbpjHnf6R26ZG*!* mice (n =5; top and bottom). Horizontal 
born with or without pBDs. a, b, Experimental design for hepatocyte line denotes the mean value. d, e, Experimental design for hepatocyte fate 
fate tracing at P17 and immunofluorescence at P120 in Rbpj/Hn fol! tracing at P39 and immunofluorescence at P120 in Alb-cre*/~ Rbpj/Hnfo 


R26ZG*'* mice (n=4). ¢, Correlation of GFP labelling efficiency between R26ZG*!*+ mice (n=3). Scale bars, 100,1m. 
hepatocytes and peripheral cholangiocytes in hepatocyte-fate-traced P120 
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Extended Data Fig. 3 | HpBDs relieve cholestasis and liver injury. 

a, Serum total bilirubin levels in P20-—P29 (n = 6), P30-P39 (n= 33), 
P40-P49 (n= 35), P50-P59 (n=8), P60-P69 (n = 46), P70-P79 (n= 22), 
P80-P89 (n= 13), P90-P119 (n= 20), P120-P149 (n=52) and >P150 
(n= 40) Alb-cre+!~ RbpjHnfo"! and P20-P29 (n= 5), P30-P39 (n= 24), 
P40-P49 (n= 19), P50-P59 (n=7), P60-P69 (n = 27), P70-P79 (n= 10), 
P80-P89 (n= 11), P90-P119 (n= 10), P120-P149 (n=41) and >P150 
(n= 25) Rbpj”Hnf6" mice. **P =0.0011 at P20-P29, ****P= 1.7 x 10-3 
at P30-P39, ****P= 8.2 x 107! at P40-P49, ***P = 0.00019 at P50- 
P59, ****P— 8.4 x 101° at P60-P69, ***P = 0.00055 at P70-P79, not 
significant (ns; P= 0.090) at P80-P89, not significant (P= 0.050) at 
P90-P119, not significant (P= 0.052) at P120-P149 and not significant 
(P= 0.064) at >P150, two-sided Welch’s t-test. b, Serial measurements 

of serum total bilirubin levels in Alb-cre+/~ Rbpj/Hnf6"! (n = 14) and 
Rbpj/Hnfo"t (n= 5) mice. P=0.11, two-sided Welch's t-test. ce, Serum 
alkaline phosphatase (ALP), alanine aminotransferase (ALT) and aspartate 
aminotransferase (AST) levels in P43-P45 (n= 13), P69-P82 (n= 13), 
P120 (n= 14) and P150 (n= 11) Alb-cre+!~ RbpjHnfo"! and P43-P45 
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(n=5), P69-P82 (n=4), P120 (n=9) and P150 (n=6) Rbpj! 

Hnfo" mice. In c: ****P = 0.000058 at P43-P45, ****P = 0.000019 at 
P69-P82, **P=0.0064 at P120, not significant (P= 0.22) at P150 and 
*** P — (0.00010 at P120 versus P69-P82, two-way ANOVA followed by 
Holm-Sidak multiple comparison test. In d: **P = 0.0050 at P43-P45, not 
significant (P= 0.47) at P69-P82, not significant (P=0.30) at P120 and 
not significant (P = 0.47) at P150, two-way ANOVA followed by Holm- 
Sidak multiple comparison test. In e: **P = 0.0024 at P43-P45, *P=0.015 
at P69-P82, ****P = 0.000073 at P120 and not significant (P= 0.061) at 
P150, two-way ANOVA followed by Holm-Sidak multiple comparison 
test. f, Sirius red staining in P15 (n =9), P70-P90 (n=7) and P120 (n=6) 
Alb-cre*!~ RbpjHnfo and P15 (n=5), P70-P90 (n= 3) and P120 (n=3) 
Rbpj!Hnf6" mice with quantification. Not significant (P = 0.94) at P15, 
PD — (000095 at P70-P90, **P =0.0074 at P120 and *P= 0.027 at 
P120 versus P70-90, two-way ANOVA followed by Holm-Sidak multiple 
comparison test. g, Immunohistochemistry and Sirius red staining in P313 
Alb-cre*'- RbpjHnf6" mice with persistent or resolved cholestasis (n = 1 
each). Horizontal lines in a-f denote mean values. Scale bars, 100 um. 
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Extended Data Fig. 4 | Isolation and gene expression profiling of 
hepatocyte-derived peripheral cholangiocytes. a, FACS gates for 
peripheral cholangiocyte (pC; EPCAM*DBA_) and hilar cholangiocyte 
(hC; EPCAM*DBA*) isolation from Alb-cret/~ Rbpj“Hnfo"! and Rbpj! 
Hnfo! mice. b, qPCR analysis of floxed Rbpj (Rbpj!") genomic DNA in 
hepatocyte-derived pC (HpC) and hC isolated from Alb-cret!— Rbpj!! 
Hnfo" mice relative to hepatocytes isolated from Rbpj/Hnfo” mice 
(dashed line; n = 3 each). Data were normalized to a downstream genomic 
region of Rbpj to control for gene copy number. Data are mean £s.e.m. 

c, d, RNA-seq analysis of normal pC (n =3 mice), HpC (n =4 mice) and 
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RBPJ- and HNF6-deficient hepatocytes (H; n =3 mice). c, Heat map of 
genes reflecting deletion of Rbpj and Hnf6 (also known as Onecut1). Rbpj 
mRNA is present in this knockout mouse as a truncated transcript that 
does not produce a functional protein”®. d, Heat map of all differentially 
expressed CYP genes distinguishing genes associated with mature (M), 
adolescent (A) and immature (I) hepatocyte differentiation or low 
expression in the liver (L)*®. P< 0.05, one-way ANOVA, FDR-corrected; 
fold change > 3 (c, d, except Rbpj and Notch1-Notch4). Bold genes denote 
P<0.05 for HpC versus pC, two-sided Student’s t-test (c, d). 
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Extended Data Fig. 5 | Proliferation in HpBDs and reactive ductules. 

a, Size distribution of wsCK-positive DBA-positive hilar cholangiocyte 
clones in P90 Alb-cret!~ Rbpj!Hnfo"/R26R-Confetti*!~ (n=2) and 
Alb-cre*!~ R26R-Confettit'~ (n= 2) mouse livers. **P =0.0079 for three 
cells and ***P = 0.00092 for seven cells, two-sided Student's t-test. 

b, Immunofluorescence of reactive ductules in hepatocyte-fate-traced P32 
Alb-cre*'- RbpjHnfo"'R26ZG*'* mouse liver (n= 3). ¢, Size distribution 
of wsCK-positive DBA-negative peripheral cholangiocyte clones in P90 
Alb-cre*!~ Rbpj!"HnfOR26R-Confettit!~ (n= 2) and Alb-cre*!— 
R26R-Confetti*'— (n= 2) mouse livers. *P = 0.032 for one cell, *P = 0.024 
for two cells, *P= 0.020 for three cells and *P=0.014 for four cells, 
two-sided Student’s t-test. Horizontal lines in a and c denote mean values. 


m= Cholangiocyte derived 


d, Immunofluorescence and breakdown of OPN-positive K167-positive 
cells based on CK19 expression in P54 Alb-cret’~ RbpjHnf6/) mouse liver 
(n=4). Arrowheads indicate OPN-positive KI67-positive CK19-negative 
cells. e, Immunofluorescence of liver of >P120 Alb-cre+!~ Rbpj/“Hnfo"! and 
RbpjHnfo"! mice after DDC diet feeding for 2 (n=1 each), 4 (n=3 each) 
and 6 (n= 1 each) weeks. f, Immunofluorescence of liver and breakdown 
of OPN-positive cells based on hepatocyte fate tracing in >P120 

Alb-cre*!~ RbpjHnfo"IR26ZG*!* (n= 4) and Rbpj//Hnfo//R26ZGt'+ 
(n=3) mice fed DDC diet for 5 weeks starting 1 week after hepatocyte fate 
tracing was induced. Data in d and f are mean + s.e.m. Scale bars, 100,1m 
(b, e, f), 501m (d). 
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Extended Data Fig. 6 | TGF® signalling in hepatocyte 
transdifferentiation. a, Ink visualization of biliary tree of P32 

Alb-cre*'~ Tgfbr2! mouse (n= 2). b, Immunofluorescence of P60 
Alb-cre*'~ RbpjHn fo! and Rbpj!‘Hnfo"! mouse livers (n =2 each). 
Arrowheads indicate psMAD3-positive HNF1-positive nuclei. c, Western 
blot with quantification of pSMAD3 in nuclear extracts from Alb- 

cre*!— RbpjHnfo', RbpjHnfo"! and Alb-cre*!~ Rbpj/HnfO'Tg fora” 
mouse livers (n= 2 each). Source data are shown in Supplementary Fig. 1. 
d-f, Experimental design (d) and results of analysis of the effect of TGF3 
signalling on biliary differentiation of adult RBPJ- and HNF6-deficient 
hepatocytes cultured in 3D for the indicated number of days (d). 

e, Phase-contrast images of RPBJ- and HNF6-deficient hepatocyte 
spheroids embedded in collagen gels and cultured in the presence or 
absence of the TGF inhibitor SB-431542 (SB) for the indicated number of 
days. f, Relative expression levels of cholangiocyte and hepatocyte genes in 


freshly isolated hepatocytes and spheroids before and after embedding in 
collagen gels. Gene expression in the liver of a mouse fed choline-deficient 
ethionine-supplemented (CDE) diet was used as a positive control. Data 
are from three independent cultures per treatment in a representative 
experiment (n= 2). *P =0.038 for Sox9 5 days, *P = 0.044 for Sox9 

10 days, **P =0.0034 for Krt19 5 days and **P= 0.0071 for Spp1 5 days, 
two-sided Welch's t-test. g, Serum total bilirubin levels in P34-P53 
Alb-cre*'~ Rbpj"Hnfo"! Tgfbr2 (n = 16) and RbpjHnfo"/Tgfbra! (n=7) 
mice. ****P = 0,000024, two-sided Welch’s t-test. h, Quantification of 
Sirius red staining in P58-P 100 Rbpj”“Hnfo"! mice after intravenous 
injection of AAV8-Eeflal-caTgfbr1 at P20 (n =4). Grey area represents 
the range of liver collagen in the indicated Rbpj/Hnf6" mice from 
Extended Data Fig. 3f. Horizontal lines in c, g and h denote mean values. 
Data in f are mean +s.e.m. Scale bars, 2mm (a), 100 1m (e), 501m (b) and 
10m (b, inset). 
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Extended Data Table 1 | Characterization of human subjects and samples 


Human |Gender |ALGS genotype ALGS phenotype Age at liver | Liver samples 

subject analysis 

ALGS_ |Male Heterozygous JAG1__| Neonatal cholestasis, bile 3 years Regenerative nodule and 

patient exon 1-26 deletion duct paucity, liver fibrosis, nonregenerated tissue 
portal hypertension, (segment 5 of right lobe and 
hepatocellular carcinoma, segment 8 of right lobe of 
liver transplantation; left explanted liver, respectively) 


pulmonary artery stenosis; 
butterfly vertebrae; facial 


features 
ALGS_ |Male Heterozygous JAG1 ___| Failure to thrive; 15 years Regenerative nodule and 
patient exon 4 mutation hepatomegaly, bile duct nonregenerated tissue 
(c.499T>A; p.W167R) | paucity, cholestasis, liver (biopsies of segment 3 of left 
fibrosis; left pulmonary artery liver lobe and segment 5 of 
stenosis; facial features right liver lobe, respectively) 
Normal | Male Normal JAG7 and None 35 years Histologically normal tissue 
control NOTCH2 sequences (resection of segment 4 of left 


liver lobe for metastasis of 
rectal adenocarcinoma) 
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Male-killing toxin in a bacterial symbiont of 


Drosophila 


Toshiyuki Harumoto! & Bruno Lemaitre!* 


Several lineages of symbiotic bacteria in insects selfishly manipulate 
host reproduction to spread in a population’, often by distorting 
host sex ratios. Spiroplasma poulsonii~” is a helical and motile, 
Gram-positive symbiotic bacterium that resides in a wide range of 
Drosophila species‘. A notable feature of S. poulsonii is male killing, 
whereby the sons of infected female hosts are selectively killed 
during development’”. Although male killing caused by S. poulsonii 
has been studied since the 1950s, its underlying mechanism is 
unknown. Here we identify an S. poulsonii protein, designated 
Spaid, whose expression induces male killing. Overexpression of 
Spaid in D. melanogaster kills males but not females, and induces 
massive apoptosis and neural defects, recapitulating the pathology 
observed in S. poulsonii-infected male embryos*''. Our data suggest 
that Spaid targets the dosage compensation machinery on the male 
X chromosome to mediate its effects. Spaid contains ankyrin repeats 
and a deubiquitinase domain, which are required for its subcellular 
localization and activity. Moreover, we found a laboratory mutant 
strain of S. poulsonii with reduced male-killing ability and a large 
deletion in the spaid locus. Our study has uncovered a bacterial 
protein that affects host cellular machinery in a sex-specific way, 
which is likely to be the long-searched-for factor responsible for 
S. poulsonii-induced male killing. 

Endosymbiotic bacteria have evolved sophisticated strategies to 
manipulate their hosts to increase their transmission, and sex ratio dis- 
torters of arthropods are examples of this manipulation. These bacteria 
are transmitted exclusively through female hosts, and asa result, several 
lineages have evolved the ability to bias infections towards females, 
either by turning males into females (feminization), causing clonal 
reproduction (parthenogenesis), or eliminating males (male killing)’. 
Male killing has independently evolved in at least six bacterial taxa, 
including Spiroplasma, Wolbachia, and Rickettsia’. The genetic basis of 
male killing is a longstanding mystery in the field of insect symbiosis. 
Male killing by S. poulsonii (Fig. 1a) was first described in the 1950s 
in Drosophila’. Previous studies of S. poulsonii attributed the selective 
killing of male progeny to an unknown substance called ‘androcidin, 
assumed to be secreted by the bacterium!”. The identification of this 
toxin has been hampered by the lack of practical methods for molecular 
biology, as has been the case for other symbiotic bacteria. 

S. poulsonii symbionts of D. melanogaster (strain MSRO for 
‘melanogaster sex ratio organism) kill all male progeny (for exam- 
ple, MSRO-H99; Extended Data Fig. 1a). We unexpectedly identified 
a S. poulsonii mutant strain that shows reduced male-killing ability 
(MSRO-SE; the partial male-killing strain), where almost half of 
the male progeny survived (Extended Data Fig. la—c). The reduced 
male killing was not linked to host genetic background or low 
bacterial titre (Extended Data Fig. 1b, d). To identify the genetic basis 
of reduced male killing, we sequenced the genome of MSRO-SE and 
compared it with that of MSRO-H99 (Extended Data Fig. 2). We 
found a candidate gene that was altered in the partial male-killing 
strain, encoding a 1,065-amino-acid protein with ankyrin repeats 
and the OTU (ovarian tumour) deubiquitinase domain. We named 
this S. poulsonii protein Spaid (S. poulsonii androcidin; Fig. 1b) based 


on our subsequent functional studies. We focused our analysis on 
this gene, because (i) both ankyrin repeats and the OTU domain are 
conserved in proteins across eukaryotes and are also present in some 
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Fig. 1 | Expression of Spaid selectively eliminates male offspring. 

a, Male killing in Drosophila induced by S. poulsonii. Infected females 
(top) produce only female offspring (bottom). Inset shows a male-killing 
S. poulsonii of D. melanogaster detected by DNA staining. b, Protein 
structure of Spaid, which contains ankyrin repeats (ANK, red), the OTU 
deubiquitinase domain (blue), an N-terminal signal peptide (SP, black), 
and a C-terminal hydrophobic region (HR, green). Spaid AC of the partial 
male-killing strain encodes a protein with an amino acid substitution 
(Q787C) and a C-terminal truncation. The structures of the two deletion 
constructs of Spaid (AANK and AOTU) are also indicated. The numbers 
represent amino acid residues. c, The number of adult progeny obtained 
from crosses between the actin-GAL4 line and four UAS transgenic lines 
(GEP, Spaid, AANK, and AOTU; n= 10 independent crosses for each 
transgene). The UAS-GFP line was used as a negative control. We counted 
the number of resultant offspring (females, red; males, blue) having 

both actin-GAL4 and UAS (+) and siblings having only UAS as internal 
controls (—). Different lowercase letters indicate statistically significant 
differences (Steel-Dwass test (see Supplementary Table 2)). P< 0.0001 

in all instances except AANK (P< 0.05). NS, not significant (P > 0.05). 
Box plots indicate the median (bold line), the 25th and 75th percentiles 
(box edges), and the range (whiskers). Dot plots show all data points 
individually. The total numbers of adult counts for each genotype and sex 
are shown at the bottom. 
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bacterial effectors that manipulate host cellular processes!*"4, and (ii) 
spaid was located on a putative plasmid’* (Extended Data Fig. 2b), 
like other bacterial virulence factors. Further analysis predicted an 
N-terminal signal peptide for secretion and a C-terminal hydropho- 
bic region (Fig. 1b). The spaid locus in the partial male-killing strain 
contained an 828-bp deletion (Extended Data Fig. 3), resulting in a 
truncated protein lacking the hydrophobic region, as well as a single 
amino acid substitution (Q787C) (AC; Fig. 1b). Of note, this gene 
was not present in an earlier published version of the S. poulsonii 
MSRO genome’® (Supplementary Data), and we found no obvious 
homologous proteins in our BLAST searches. 

To test whether Spaid is responsible for male killing, we used the 
GAL4/UAS system to overexpress the gene as a C-terminal green flu- 
orescent protein (GFP) fusion protein in D. melanogaster. Notably, 
Spaid expression with the actin-GAL4 ubiquitous driver eliminated 
all male offspring, but had no impact on female emergence (Fig. 1c). 
Thus, Spaid kills flies in a sex-dependent manner. 

Male killing in Drosophila induced by S. poulsonii is associated with 
abnormal apoptosis”? and neural disorganization®*” during embry- 
ogenesis. The mechanism of the neural defects is not known, but has 
been suggested to be independent of apoptosis*”. If Spaid is the bona 
fide male-killing factor, its expression in embryos should phenocopy 
the pathology described above. We employed the nanos-GAL4 mater- 
nal driver to express Spaid during early embryogenesis and mon- 
itored apoptosis (Fig. 2; with terminal deoxynucleotidyl transferase 
dUTP nick-end labelling (TUNEL) staining) and neural organization 
(Extended Data Fig. 4; using the neuron-specific marker Elav'’). The 
sex of embryos was determined by antibody staining for the Sex lethal 
(Sxl) protein, which is expressed only in females'®. We found that Spaid 
expression induced strong apoptosis and neural disorganization in 
male but not female embryos (Fig. 2 and Extended Data Fig. 4). The 
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Fig. 2 | Expression of Spaid reproduces male-killing phenotypes 
during embryogenesis. a, b, Representative images of stage 13-14 female 
(a, n= 14) and male (b, n= 16) embryos maternally expressing Spaid, 
stained for apoptosis (TUNEL; green), Sxl (magenta), and DNA (blue). 
Single-channel images of TUNEL and Sxl are also shown. c, Quantification 
of TUNEL signals in stage 11-12 and 13-14 embryos (females, red; males, 
blue). Different characters indicate statistically significant differences 

(P < 0.0001; Steel-Dwass test; see Supplementary Table 2). Box and dot 
plots as in Fig. 1c. Sample sizes (n, number of embryos) are shown at the 
bottom. Embryos were co-stained for Elav, TUNEL, Sxl, and DNA, and 
selected channels are shown in a, b and Extended Data Fig. 4. 


level of apoptosis in male embryos that expressed Spaid increased as 
their development proceeded (Fig. 2c), similar to embryos infected 
with S. poulsonii?. Although numerous cells underwent apoptosis in 
male embryos that expressed Spaid, the neural cells did not appear to 
suffer ectopic cell death (Extended Data Fig. 4c). Thus, the expression 
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Fig. 3 | Spaid acts through the MSL complex. a, b, Epithelial cells in stage 
9 male embryos expressing GFP (a) and Spaid (b), stained for pH2Av 
(green), MSL1 (magenta), and DNA (blue). c, Quantification of pH2Av 
foci in embryos expressing GFP and Spaid. d, Percentage of pH2Av foci 
overlapping with MSL1 signals in male embryos expressing; GFP, 6.3% 
(0-19.3%) (median (interquartile range)) and Spaid, 52.9% (46.2-64.2%). 
e, f, Dividing cells in stage 9 male embryos expressing GFP (e, proper 
segregation) and Spaid (f, a broken bridge) stained for DNA (green) and 
MSLI (magenta). g, The number of chromatin bridges containing (black; 
numbers on the left) or not containing (grey; numbers on the right) 


MSLI signals in embryos expressing GFP and Spaid. h, The number of 
MSL] focal signals in male embryos expressing GFP (black) and Spaid 
(green). The same datasets of stage 8-10 embryos were analysed in 

a-h (n=50 or 48 images per condition). Different lowercase letters or 
asterisks indicate significant differences: c, P< 0.001, Steel-Dwass test; 

d, P< 0.0001, x? test; and h, P< 0.0001, two-tailed Mann-Whitney U test 
(see Supplementary Table 2). Box and dot plots in c, d, h are as in Fig. 1c 
and sample sizes are shown at the bottom in c. All UAS transgenes were 
expressed maternally. 
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of Spaid is sufficient to induce the two different pathologies caused by 
male-killing S. poulsonii. 

In Drosophila (XX female, XY male), the single male X chromo- 
some is hyper-transcribed by twofold to equalize gene expression 
levels between sexes. This dosage compensation system is mediated by 
a protein-RNA complex called the male-specific lethal (MSL) complex, 
which is selectively recruited to the male X chromosome”’. Previous 
studies have uncovered a link between the male-killing action of 
S. poulsonii and the host’s dosage compensation machinery®!°!!. 
Genetic experiments revealed that S. poulsonii fails to kill males lacking 
the MSL components, while it can induce death in females ectopically 
expressing the MSL complex®!®!", This suggests that S. poulsonii either 
targets the MSL complex directly or targets its downstream chroma- 
tin modifications (for example, acetylation of histone H4 at K16)”’. 
Remarkably, we found that the expression of Spaid triggered massive 
apoptosis in transgenic female embryos engineered to express the 
MSL complex (Extended Data Fig. 5a—d), indicating that this toxin 
mediates its effects through the dosage compensation machinery. We 
previously showed that S. poulsonii infection triggers DNA damage 
and segregation defects on the male X chromosome, thereby activat- 
ing male-specific apoptosis!’. To test whether Spaid expression can 
reproduce these phenotypes, we monitored DNA damage on the male 
X chromosome by antibody staining for the phosphorylated histone 
H2Av (pH2Av)”’ and MSL1, a component of the MSL complex label- 
ling the male X chromosome. In control embryos expressing GFP 
and female embryos expressing Spaid, a few H2Av foci were detected, 
whereas male embryos expressing Spaid exhibited numerous bright 
foci (Fig. 3a-c) that frequently overlapped with MSL] signals (Fig. 3d). 
We also found that Spaid expression in male embryos caused the for- 
mation of chromatin bridges, of which 87.1% contained MSL] signals 
(Fig. 3e-g). The MSL1-labelled chromosomes were often fragmented 
and unevenly distributed after mitosis (Fig. 3f; 47.7% of the MSL1- 
containing bridges). In addition, MSL1 signals were reduced in male 
embryos that expressed Spaid, and we observed this phenotype at an 
even earlier time point (Fig. 3h and Extended Data Fig. 5e, f} stages 
8-10) than has been described for embryos infected by S. poulsonii 
(from stage 13 onward)!°. This is probably because the high expression 
of Spaid driven by the GAL4/UAS system results in much more severe 
effects than S. poulsonii infection. Finally, we examined the subcellular 
distribution of Spaid in larval salivary glands, whose large cellular sizes 
facilitate detailed cytological analyses. In both sexes, Spaid-GFP was 
found at plasma membranes, in the cytoplasm, and throughout nuclei, 
but was absent from the nucleolus (Fig. 4a, b). Notably, Spaid-GFP 
was enriched on MSL1-labelled chromosomes in male nuclei (Fig. 4b). 
This result is consistent with its reliance on the dosage compensation 
machinery, although equivalent localization analyses in embryos are 
needed. 

To better characterize the action of Spaid, we made deletion con- 
structs lacking either ankyrin repeats (AANK) or the OTU domain 
(AOTU) (Fig. 1b). Expression of AANK with actin-GAL4 did not kill 
males, pointing to a pivotal role of ankyrin repeats in Spaid activity 
(Fig. 1c). The expression of AOTU with actin-GAL4 still eliminated 
adult males, but they were killed later in development (pupal stage) 
compared with wild-type Spaid (second instar larval stage) (Fig. 1c). 
Furthermore, and in contrast to wild-type Spaid, weaker expression 
of AOTU with armadillo-GAL4 failed to kill males (Extended Data 
Fig. 6), indicating that the OTU domain is required for full activity. 
Notably, although its distribution pattern was otherwise indistin- 
guishable from that of Spaid-GFP in salivary gland cells, AANK- 
GFP was not enriched on MSL-labelled chromosomes (Fig. 4c, d). 
By contrast, AOTU-GFP was only weakly detected within nuclei 
regardless of sex, and colocalization with MSL was not apparent 
(Fig. 4e, f). The male-killing activity of AOTU when overexpressed 
may be caused by excess amounts of protein overriding its localiza- 
tion defect. All of these observations can be reconciled in a model 
of the chromosomal targeting of Spaid in which the OTU domain 
promotes nuclear localization and ankyrin repeats interact with the 
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Fig. 4 | Subcellular localization of Spaid. a-f, Larval salivary glands 
expressing Spaid-GFP (a, female, n = 13; b, male, n= 17), AANK-GFP 

(c, female, n = 9; d, male, n= 16), and AOTU-GFP (e, female, n = 15; 

f, male, nm = 12) stained for MSL1 (magenta) and DNA (blue). For GFP 
(green), raw fluorescence signals were detected. Magnified views of nuclei 
are shown. Dark spots inside nuclei in GFP images represent the nucleolus. 
Arrowheads indicate GFP signals associated with plasma membranes. 
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MSL complex or downstream histone modifications (Extended Data 
Fig. 7). 

In summary, we have discovered an S. poulsonii gene that is likely to be 
responsible for male killing. Notably, expression of this single gene was 
sufficient to recapitulate the phenotypes associated with male killing. 
Our analysis revealed that Spaid is the sole ankyrin-repeat protein in 
the S. poulsonii genome (Supplementary Data). This contrasts with 
Wolbachia, whose genomes encode more than 20 ankyrin-repeat 
proteins”)? Notably, the Wolbachia strain wMel has a prophage- 
associated gene WD0633, that contains ankyrin repeats and the OTU 
domain”. Nevertheless, its transgenic expression in D. melanogaster 
produced no obvious phenotypes in a previous report™*. Recent studies 
discovered Wolbachia genes that cause cytoplasmic incompatibility, 
a reproductive manipulation whereby symbiont-free females are 
unable to reproduce when mated with infected males**-’”. Some of the 
genes that cause cytoplasmic incompatibility along with Spaid contain 
deubiquitinase domains, raising the possibility that host ubiquitination 
pathways are a common target in distinct strategies. Future research 
will focus on the identification of host cellular targets of Spaid, thereby 
deciphering the principles of its sex-specific activity. A thorough 
understanding of the reproductive manipulations induced by 
symbionts would not only provide novel insights into fundamental 
aspects of development, sex determination, and their evolution in 
insects, but could also provide clues to control insect populations. 
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METHODS 


Fly stocks and genetics. Laboratory stocks of D. melanogaster were maintained 
at 25°C with standard cornmeal medium. The fly stocks used in this study were 
cultured in tetracycline-containing medium (0.7—0.8 mg ml‘) for one genera- 
tion to eliminate possible contamination with Wolbachia and Spiroplasma. After 
treatment, about ten females and/or males were checked by diagnostic PCR with 
specific primers for Wolbachia (wsp_81F/691R)** and Spiroplasma (16SA1F/ 
TKSSsp)”?? (Supplementary Table 1). Treated flies were maintained in the nor- 
mal medium for at least two generations before use in experiments. The following 
lines were obtained from the Bloomington Drosophila Stock Center at Indiana 
University (BDSC) and the Department of Drosophila Genomics and Genetic 
Resources at Kyoto Institute of Technology (DGGR): actin5C-GAL4 (actin-GAL4; 
BDSC 4414), nanos-GAL4::VP16 (nanos-GAL4; BDSC 4937), tubulin-GAL80° 
(BDSC 7108), armadillo-GAL4 (BDSC 1560), UASp-eGFP (DGGR 116071), and 
CyO, ActGFP (the green balancer; DGGR 107783). The nanos-GAL4 flies, which 
were found to be Wolbachia and Spiroplasma-free by PCR, were not treated with 
antibiotics, because they became sick after treatment. Oregon-R (used as a wild- 
type line) and msl1'/CyO; H83M2°" flies were generously provided by T. Murata 
and M. Kuroda, respectively. 

To express the Spaid-encoding gene, we used the GAL4/UAS system*” 
(see below). For the zygotic expression of Spaid, actin-GAL4/CyO (Fig. 1c) or 
armadillo-GAL4 (homozygous; Extended Data Fig. 6) flies were crossed to 
homozygous UAS transgenic flies. To determine the lethal phase during larval 
stages, we analysed the number of teeth present in mouth hooks of killed larvae*’. 
For maternal expression*’, UAS females were crossed to nanos-GAL4 males, and 
the resultant female progeny were mated with Oregon-R males (Figs. 2, 3 and 
Extended Data Fig. 4 and 5e, f). For the ectopic formation of the MSL complex in 
females, msl1/°°/CyO, ActGFP; H83M2 flies were used instead of Oregon-R. The 
H83M2 transgene expresses the msl-2 coding sequence (lacking Sxl-binding sites 
and resistant to translational repression) under the control of a heat shock-inducible 
promoter*, but we used its leaky basal expression. The resultant embryos were 
distinguished by GFP staining and only GFPT embryos (wild-type for msl1) were 
analysed (Extended Data Fig. 5a—d). For the expression of Spaid in larval salivary 
glands (Fig. 4), we used a recombined actin-GAL4, tubulin-GAL80°/CyO line to 
avoid male lethality and let them grow until the third instar larval stage. Crosses 
maintained at 18°C for 7-8 days were shifted to 29°C and kept for 1 day before 
dissection. Only GFP* wandering third instar larvae were dissected. 
Construction of transgenic fly lines. For the gene synthesis of the Spaid coding 
sequence by the GeneArt service (Thermo Fisher Scientific), TGA stop codons, 
which encode tryptophan in Spiroplasma, were mutated into TGG. Codon usage 
was optimized for the expression in D. melanogaster. Consequently, GC content 
was modified from 23.3% to 50.4%. We divided the Spaid coding sequence into 
two parts and synthesized them separately: one was a 2,367-bp fragment that corre- 
sponds to Spaid AC and the other was an 838-bp fragment encoding the remaining 
3’ portion of full-length Spaid. To obtain the full-length Spaid coding sequence 
(3,195 bp, without a stop codon), the two fragments were fused using PCR with 
a 30-bp overlap and cloned into the pENTR vector using the pENTR/D-TOPO 
cloning kit (Thermo Fisher Scientific). To generate Spaid deletion constructs, 
we amplified two PCR fragments (nucleotide positions 1-276 and 787-3195 for 
AANK; nucleotide positions 1-789 and 1396-3195 for AOTU) from the synthetic 
Spaid coding sequence, fused them using PCR with a 24-bp overlap and cloned 
the product into the pENTR vector. We used PrimeSTAR HS DNA Polymerase 
(Takara Bio) for all PCR reactions. The Gateway cassette containing the Spaid cod- 
ing fragments was transferred into the pPWG destination vector (The Drosophila 
Genomics Resource Center 1078; The Drosophila Gateway Vector Collection) by 
the LR clonase II enzyme mix kit (Thermo Fisher Scientific) to construct pUASp- 
Spaid-eGFP, pUASp-Spaid. AANK-eGEP, and pUASp-Spaid. AOTU-eGFP plas- 
mids. Transgenic fly lines were generated by the standard microinjection method 
for P-element transformation (BestGene). 

Identification and characterization of the partial male-killing S. poulsonii 
strain. Male-killing S. poulsonii can be easily transferred to other fly stocks by 
haemolymph injection”. S. poulsonii containing haemolymph was collected from 
a naturally infected D. melanogaster line Ug-SR provided by J. Jaenike**, and trans- 
ferred to wild-type Oregon-R flies that were used as the source of haemolymph for 
subsequent artificial infection. We established two artificially infected D. melano- 
gaster stocks, Df(3L)H99 (DGGR 106395) and Sxl-eGFP (BDSC 24105) in 2012 
(Extended Data Fig. 1a). These infected stocks had shown perfect male killing (no 
emergence of adult males) for a period of at least 20 generations after the establish- 
ment of stable infection’. Afterwards, however, the infected Sxl-eGFP line started 
to produce male escapers, while the Df(3L)H99 line showed complete male killing. 
We collected haemolymph from these fly stocks and injected them into Oregon-R 
flies to exclude the effect of host genetic background. We confirmed that only the 
S. poulsonii strain collected from the Sxl-eGFP line showed partial male killing 
(Extended Data Fig. 1b, c). The male-killing S. poulsonii strain in D. melanogaster is 


conventionally designated as MSRO (melanogaster sex ratio organism). Therefore, 
we call these S. poulsonii strains MSRO-H99 (the complete male killer as a control) 
and MSRO-SE (the partial male killer) after the genotypes of the host fly lines from 
which they were derived (Df(3L)H99 and Sxl-eGEP, respectively). We also refer 
to the original male-killing strain from the Ug-SR line as MSRO-Ug according to 
the previous study”. 

We suspect that the partial male-killing strain might have resulted from 
genome rearrangements as reported in other Spiroplasma species maintained in 
the laboratory*®. Repeat-rich sequences like viral fragments are often associated 
with genomic instability of Spiroplasma*’, and accordingly, the spaid gene of the 
partial male-killing strain seemed to be truncated by rearrangements between 
genes containing repetitive sequences (for example, integrase and p58; Extended 
Data Fig. 3b). To keep the partial male-killing strain, we used aged females (7-11 
days) harbouring high bacterial titres to prevent the loss of the symbiont during 
vertical transmission. Because the titre of the partial male-killing strain appears 
to be higher than the original strain in some aged females, this may have led to 
the propagation of mutant bacteria and promoted their transmission (Extended 
Data Fig. 1d). 

Analysis of S. poulsonii titres by quantitative PCR. Oregon-R virgin females 
infected with MSRO-Ug, MSRO-H99, and MSRO-SE were collected and aged for 
0 (virgin), 7 or 14 days and individually transferred to a 1.5-ml tube and stored at 
—80°C. They were then homogenized with plastic pestles by hand, and genomic 
DNA was purified using the DNeasy Blood & Tissue Kit (Qiagen). A portion of the 
S. poulsonii dnaA gene and the D. melanogaster RpS17 gene were amplified with 
specific primer sets (dnaA_109F/246R** and rps17_615F/695R®, respectively) 
(Supplementary Table 1) under the following PCR conditions: 95°C for 5 min 
and then 45 cycles of 95°C for 10s, 55°C for 20s and 72°C for 20s. The 10-l 
PCR mixture contained 21] of genomic DNA, 1 x LightCycler 480 SYBR Green 
I master mix (Roche) and 0.54.M primers. Quantitative PCR was performed by 
LightCycler 480 II (Roche). Cp values were obtained by the second derivative maxi- 
mum method and the values of technical duplicates were averaged. We excluded 
samples if their duplicates had a difference in Cp values >0.5 cycles or if their Cp 
values were more than 30 cycles (2 out of 63 samples in total). To calculate the 
relative copy number of S. poulsonii dnaA gene against the host RpS17 gene, we 
followed the Pfaffl method”. To estimate the PCR efficiency of each primer set, 
we performed quantitative PCR by using six tenfold serial dilutions of genomic 
DNA purified from ten female adult flies infected with MSRO-Ug, aged for 4-6 
days after eclosion. PCR efficiency values were 82% (dnaA_109F/246R) and 98% 
(rps17_615F/695R), respectively. 

Whole-genome sequencing of male-killing S. poulsonii. To collect a sufficient 
amount of DNA for whole-genome sequencing of S. poulsonii strains MSRO-H99 
and MSRO-SE, we recovered fly haemolymph by a previously developed centrifu- 
gal separation method"! with minor modifications. We first prepared a 0.5-ml poly- 
propylene tube (Sarstedt, 72.704) whose bottom has a slit made by a sharp blade 
(hereafter called cartridge). Under CO) anaesthesia, we pricked the preepisternal 
area of the thorax of infected female adults and put them into the cartridge. The 
cartridge containing 30-40 flies was inserted into a 1.5-ml tube and centrifuged 
at 5,000 r.p.m. (2,300g) for 5 min at 4°C. We usually made between three and 
ten cartridges at one time and they were kept on ice until centrifugation. After 
removing cartridges, we checked precipitates that contained host haemolymph and 
S. poulsonii under a stereomicroscope, and confirmed that there were no large 
debris (embryos and carcasses etc.). The tubes were washed with 40011 PBS to 
resuspend and merge all precipitates. The merged suspension was filtered with 
a 0.65-1m pore size membrane filter (Ultrafree- MC; Merck, UFC30DV00) by 
centrifugation at 12,000g for 3 min at 4°C to eliminate large microorganisms 
including yeasts. The flow-through was centrifuged at maximum speed for 
another 10 min and the supernatant was removed. Remaining bacterial pellets 
were stored at —80°C. After suspending the bacterial pellet in 180 11 buffer ATL 
(Qiagen), 2011 proteinase K solution (Qiagen) was added and pulse-vortexed 
for 10s, and the mixture was then incubated for 1h at 56°C. Then, 411 RNase 
A (100 mg mI !; Qiagen) was added and treated for 5 min at room temperature. 
Genomic DNA was purified by the conventional phenol-chloroform extraction 
method (see PacBio Shared Protocol online, https://www.pacb.com/wp-content/ 
uploads/2015/09/SharedProtocol-Extracting-DNA-usinig-Phenol-Chloroform. 
pdf). During the extraction, samples were mixed by pulse-vortexing within 20s 
to prevent shearing of high molecular weight DNA. We collected haemolymph 
from 1,843 (MSRO-H99) and 1,378 (MSRO-SE) adult females to recover 18.9 .g 
and 7.56\1g genomic DNA, respectively (quantified by NanoDrop 1000; Thermo 
Fisher Scientific). 

The genomic DNA was purified with Agencourt AMPure XP beads (Beckman 
Coulter) and was sheared in a Covaris g- TUBE (Covaris) to obtain 20-kb frag- 
ments. After shearing, the size distribution of DNA was checked by the Fragment 
Analyzer (Advanced Analytical Technologies). Sheared DNA (41g, MSRO-H99; 
51g MSRO-SE) was used to prepare a SMRTbell library with the PacBio SMRTbell 
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Template Prep Kit 1.0 (Pacific Biosciences) according to the manufacturer’s 
recommendations. The resulting library was size selected on the BluePippin system 
(Sage Science) for molecules larger than 16 kb (MSRO-H99) or 18kb (MSRO-SE). 
The recovered library bound to MagBeads was sequenced on a single SMRT Cell 
with P6/C4 chemistry per genome by the PacBio RS II system (Pacific Biosciences) 
at 240-min movie length. Genome assembly was performed with the HGAP 
(Hierarchical Genome Assembly Process) software (Pacific Biosciences) version 
3 and version 2 for the MSRO-H99 and MSRO-SE genomes, respectively. Library 
preparation, whole-genome sequencing and genome assembly were performed in 
the Lausanne Genomic Technologies Facility (GTF) at the University of Lausanne 
(UNIL). 

Genomic data analyses and protein domain searches. Genomic data analysis 
was performed on the Bio-Linux 8 platform (NERC Environmental Omics)”. 
The genome sequences of MSRO-H99 and MSRO-SE assembled into five and two 
contigs, respectively (Extended Data Fig. 2a), and were aligned with the previously 
published MSRO genome’ and ordered using Mauve v.2.4.0*4 (Extended Data 
Fig. 2b). For the MSRO-H99 genome, three major contigs (1-3) were assigned 
to the main chromosome, while the remaining two minor contigs (4 and 5) were 
assigned to extra chromosomes. Two contigs of the MSRO-SE genome were 
assigned to the main chromosome (contig 1; circularized) and the extra chro- 
mosome (contig 2). These extra chromosomes could be plasmids because they 
contain several proteins, such as Soj (the chromosome partitioning protein) and 
ARPs (adhesion-related proteins; P58, P12, P54, P123 and P18), which are located 
on plasmids in other Spiroplasma species”. 

Whole-genome annotation was conducted by using Prokka v.1.11*°. We cre- 
ated a custom annotation database by combining published genomic sequences 
of several Spiroplasma species (S. citri CII3-3X (AM285301)”, S. melliferum KC3 
(AGBZ02000000)*, S. melliferum IPMB4A (AMGI01000000)”, S. chrysopicola 
DF-1 (CP005077)*, S. poulsonii MSRO (NZ_JTLV00000000)"*, S. kunkelii CR2-3X 
(CP010899)*°; the numbers in parentheses represent GenBank accession numbers). 

To identify conserved protein domains including ankyrin repeats, annotated 
protein sequences were analysed using the NCBI Conserved Domain Database 
(NCBI CDD)°>!*? with default parameters (Supplementary Data). Besides 
spaid (SMH99_ 26490; 3,198 bp), other genes containing the OTU domain 
(SMH99_25890; 597 bp) and ankyrin repeats (SMH99_25900; 204 bp) were pre- 
dicted on an identical contig (contig 4; 87,892 bp) in MSRO-H99. It is likely that 
they are derived from the misassembly or partial assembly of Spaid-encoding reads, 
because they are located close to the end of the contig (from 4,133 to 5,048 bp) 
with much lower coverage (below 10x) compared to the spaid locus (from 51,206 
to 54,403 bp) with high coverage (over 330). These genes were not found in 
MSRO-SE. Further domains were predicted using the InterPro protein sequence 
analysis and classification database*’. The protein domain structure of Spaid was 
drawn using the Illustrator for Biological Sequences (IBS) software v 1.0.2°4 and 
modified manually. 

Homology searches. For homology searches, we used the protein BLAST (blastp) 
program on the NCBI BLAST website with a non-redundant protein sequence 
database (nr). After searching the entire database without specifying the organism, 
we also searched bacteria, viruses, and Drosophila databases separately; however, 
we found no protein sequences that aligned to the entire Spaid protein sequence. 
The top hit having the highest score was a hypothetical protein of Spiroplasma 
kunkelii (WP_053391598), which covers the portion of the N- and C-terminal 
sequences of Spaid (amino acid positions 1-49 (score, 45.4; E-value, 0.71; identi- 
ties, 48.98%; positives, 69.39%; gaps, 14.29%) and 704-1,065 (score, 385; E-value, 
2.63 x 10° |'; identities, 58.03%; positives, 69.63%; gaps, 14.07%), respectively). 

Sanger sequencing of the spaid locus. Ten adult Oregon-R females that were 
uninfected (as a negative control) or infected with MSRO-Ug, MSRO-H99, and 
MSRO-SE were collected and genomic DNA was purified using the method 
described above for quantitative PCR. The 3’ portion of the spaid gene was ampli- 
fied with forward and reverse primer sets (spaid_1568F with spaid.L_+136R for 
MSRO-Ug and MSRO-H99 and spaid.S_ +362R for MSRO-SE, respectively) 
(Supplementary Table 1) under the following PCR conditions: 95°C for 2min 
then 30 cycles of 95°C for 30s, 50°C for 30s and 60°C for 3 min. Owing to the 
low GC content of the spaid gene (23.3%), extension at 60°C instead of 72°C 
was essential for successful PCR amplification®*. The 50-1 PCR mixture con- 
tained 21] of genomic DNA, 1.25 U of GoTaq G2 DNA polymerase (Promega), 1 x 
green reaction buffer (Promega), 0.5,1M primers, and 0.2mM dNTP mixture. The 
PCR fragment was purified from agarose gel using the Wizard SV Gel and PCR 
Clean-Up System (Promega) and read by direct sequencing. To design primers, 
we referred to the genome sequences assembled from the PacBio data. No distinct 
PCR amplification was detected in uninfected Oregon-R female flies, confirming 
that the spaid gene is not encoded by the D. melanogaster genome but by the 
S. poulsonii genome. 

Staining and imaging. To collect embryos, we put female and male flies in culture 
vials and waited for two days. These adults were transferred to egg-collection cages 
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with grape-juice agar plates and fed with yeast paste. Developmental staging of 
embryos was followed refs. 56,57. Embryos were collected from grape-juice agar 
plates and dechorionated in 2.8% sodium hypochlorite solution, they were sub- 
sequently fixed in 1:1 mixture of heptane and 4% paraformaldehyde (EM Grade; 
Electron Microscopy Sciences, 15710) diluted in PBS for 20 min at room temper- 
ature, and devitellinized by vigorously shaking in heptane/MeOH. The embryos 
were washed in MeOH three times and rehydrated through an EtOH series (95%, 
70%, 50% and 35%), and then washed in PBT (PBS containing 0.1% Triton X-100). 
After treated with a blocking buffer (PBT containing 2% bovine serum albumin 
(BSA, Fraction V; MP Biomedicals, 02160069)) for 60 min, the embryos were incu- 
bated with primary antibodies at 4°C overnight, washed three times in PBT and 
incubated with secondary antibodies at room temperature for 90 min. Antibodies 
were diluted in the blocking buffer. Anti-Sxl and anti-MSL1 antibodies were used 
for sexing embryos (see below). 

The following primary antibodies were used: mouse anti-Sxl (1:20; 
Developmental Studies Hybridoma Bank (DSHB), M18)°*, mouse and rabbit 
anti-MSL1 (1:200 and 1:500; kindly provided by J. Lucchesi), rabbit anti-histone 
H2AvD pS137 (pH2Av; 1:1,000; Rockland Immunochemicals, 600-401-914), 
chicken anti-GFP (1:500; Aves Labs, GFP-1020), rat anti-Elav (1:20; DSHB, 
7E8A10)*. Secondary antibodies (1:1,000; Alexa Fluor 488/555/647 conjugate) 
were purchased from Molecular Probes (Thermo Fisher Scientific). DNA stain- 
ing was carried out with DAPI (1g ml~'; Sigma, D9542) for 10 min at room 
temperature after secondary antibody staining. TUNEL staining was performed 
by using the In situ Cell Death Detection Kit, TMR red (Roche), and the embryos 
stained with primary antibodies were incubated in 5011 TUNEL reaction mixture 
with secondary antibodies at 4°C overnight. Stained embryos were washed three 
times in PBT, mounted in FluorSave Reagent (Calbiochem) and observed under a 
confocal microscope (Zeiss LSM 700). 

To detect S. poulsonii in host haemolymph (Fig. 1a), the abdomen of an adult 
female was dissected in 511 PBS with tweezers on a microscope slide. One microli- 
tre of SYTO 9 nucleic acid stain solution (0.02 mM; Thermo Fisher Scientific) was 
added and observed under a widefield microscope equipped with a CCD camera 
(Zeiss Axio Imager Z1/AxioCam MRm). 

To monitor the subcellular distribution of the GFP fusion proteins of Spaid, we 

dissected out salivary glands from wandering third instar larvae and fixed them in 
4% paraformaldehyde diluted in PBS for 15 min at room temperature. Blocking and 
staining were performed as described above. For GFP, we detected raw fluorescent 
signals without antibody staining. 
Image analysis and processing. Confocal z-sections were max projected using the 
Fiji software package® with a custom macro. Image analysis was performed using 
custom R scripts and the EBImage package". For the quantification of TUNEL 
signals of whole embryos (Fig. 2c and Extended Data Fig. 5d) (acquired using a 
20x/0.8 objective with 0.6 x zoom scan; frame size: 512 x 512; 1.541m, two times 
optimal intervals), maximum projection images of DAPI and TUNEL staining were 
binarized and the former was used to make an embryonic mask image. TUNEL 
signals inside a corresponding mask image were measured by image integration. 
This value was divided by mask image area to normalize the embryonic size. 

For the quantification of focal pH2Av signals (Fig. 3c), embryos from stages 
8-10 were triply stained (pH2Av, MSL1, and DAPI) and two images were acquired 
per embryo (using a 63 x/1.4 oil immersion objective; frame size, 1,024 x 1,024; 
0.3 |4m optimal intervals). We compiled 20 serial z-sections from the top (from 25 
sections in total) to make projected images of pH2Av and MSLI signals. These 
images were smoothed using a Gaussian filter and binarized using the moving 
average method, respectively. To identify focal pH2Av signals, image objects 
were extracted from the stacked images by segmentation and labelling (objects 
that were smaller than 20 pixels were eliminated to remove noise and ambiguous 
signals). To calculate the enrichment of focal pH2Av signals on the MSL1-labelled 
chromosome (Fig. 3d), overlaps between focal pH2Av signals and MSLI signals 
were obtained by image integration. To quantify the number of MSL] signals with 
discrete focal shapes (Fig. 3h), objects over 30 pixels were extracted and counted. 
The data sets described above were also used to count the number of chromatin 
bridges (Fig. 3g). The 25 serial z-slice images were visually inspected to detect 
chromatin bridges in the Fiji software. 

The brightness and contrast of the presented images in the manuscript were 
adjusted using the level tool in the GIMP 2.8 software package. The adjustment 
was performed uniformly on the entire images and only black/white input levels 
were modified. 

Statistics and reproducibility. No statistical methods were applied to predeter- 
mine sample size. The experiments were not randomized. The investigators were 
not blinded to allocation during experiments and outcome assessment. We used the 
R v3.4.1 software package (the R Foundation) for all statistical analyses. Multiple 
comparisons in Figs. 1c, 2c, 3c and Extended Data Fig. 1d, 5d were performed 
using the Steel-Dwass test of the pSDCFlig function in the NSM3 R package”. 
We performed the x? test in Fig. 3d and the Mann-Whitney U test (two-tailed) 
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in Fig. 3h and Extended Data Fig. 6. P-values less than 0.05 were considered as 
significant. Exact P-values are listed in Supplementary Table 2. For the zygotic 
expression of spaid and its deletion constructs (Fig. 1c and Extended Data Fig. 6), 
two independently established UAS transgenic lines were used to verify the repro- 
ducibility of the results. Other experiments were independently repeated at least 
two times, except for the qPCR analysis in Extended Data Fig. 1d (one experiment 
with 6-7 biological replicates). 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. Whole genomic sequence data have been deposited at GenBank 
under the BioProject number PRJNA416288. Sequence data for synthetic gene 
fragments have been deposited at GenBank under the accession numbers 
MG837001 and MG837002. Source Data for Figs. 1, 2, 3 and Extended Data Fig. 1, 
5, 6 are available in the online version of the paper. All relevant data supporting 
the findings of this study are included within the article and its Supplementary 
Information Files or are available from the corresponding authors upon reason- 
able request. 
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Extended Data Fig. 1 | Identification and characterization of the partial 
male-killing S. poulsonii strain. a, An illustration showing the origin of 
the S. poulsonii strains analysed in this study. Pictures show male-killing 
(MK) S. poulsonii of D. melanogaster. MSRO-Ug is the original male- 
killing strain maintained in the Oregon-R wild-type fly. Fly stocks (Df(3L) 
H99 and Sxl-eGFP) artificially infected with this original strain showed 
complete male killing (100% MK) for the first 20 generations. Afterwards, 
one strain (MSRO-SE) started to show the partial male-killing phenotype, 
whereas the other (MSRO-H99) kept the ability to induce complete male 
killing. See Methods section ‘Identification and characterization of the 
partial male-killing S. poulsonii strain for more detail. b, c, Sex-ratio 
analysis of the adult progeny obtained from Oregon-R flies infected 

with MSRO-Ug, MSRO-H99, and MSRO-SE. We repeated experiments 
three times on the fourth, fifth and sixth generations (G4-6) after the 
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establishment of infection. In c, the relative number of male offspring 
(percentage of females) obtained from Oregon-R female flies infected with 
MSRO-SE are plotted. Data points were excluded if the total count of flies 
was below 10. d, Relative titre of S. poulsonii within individual female flies. 
Adult females infected with three MSRO strains were kept for 0, 7 and 

14 days after eclosion and analysed by qPCR. Data were normalized with 
respect to females from day 0 that were infected with MSRO-Ug. Different 
letters indicate statistically significant different groups (P< 0.01; N.S., 

not significant, P > 0.05; Steel-Dwass test; see Supplementary Table 2). 
Please note that the titres of the three strains are comparable and even the 
higher titre in old females (see 14 days in d) fails to induce complete male 
killing in MSRO-SE (c). Box and dot plots are as in Fig. 1c and sample sizes 
(n, number of adult flies) are shown at the bottom. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


. MSRO, 2015 MSRO-H99 MSRO-SE 
Reference Paredes et al., 2015 This work This work 
Male killing Complete Complete Partial 
Contigs 12 5 2 
Largest contig (bp) 504,367 1,417,292 1,883,572 

Contig 1 - 1,417,292 1,883,572 

Contig 2 - 333,653 55,268 

Contig 3 - 257,938 - 

Contig 4 - 87,892 - 

Contig 5 - 34,188 - 

Total length (bp) 1,771,859 2,130,963 1,938,840 

GC (%) 27 26 26 

N50 179,219 1,417,292 1,883,572 

N75 155,942 333,653 1,883,572 

L50 3 1 1 

L75 6 2 1 

Total genes 1,976 2,749 2,516 

CDS 1,942 2,715 2,482 

tRNA 31 31 31 

rRNA 3 3 3 

b 
500 kb 1000 kb 1500 kb 
MSRO, 2015 500 kb 1000 kb 1500 kb a= 


2000 kb 


Contig 1.1 


MSRO-H99 oe 


1000 kb 


Contig 2 Contig 3 


1500 kb 


MSRO-SE 


Extended Data Fig. 2 | Whole-genome sequencing studies of the male- 
killing S. poulsonii variants. a, Comparison of the genomic features of 
three S. poulsonii strains. MSRO-H99 and MSRO-SE are newly obtained 
variants isolated in this study (Extended Data Fig. 1a). As a control, data 
from the previously reported male-killing S. poulsonii genome’ are also 
indicated. b, Whole-genome alignment of the three S. poulsonii strains. 
To start the alignment from the dnaA gene, contig 1 of MSRO-H99 was 


split into two fragments (contig 1.1 and 1.2). The locations of the contigs 
corresponding to extra chromosomes (putative plasmids; see Methods 
section ‘Genomic data analyses and protein domain searches’) are shown 
as ‘extra. spaid (gene ID from GenBank BioProject PRJNA416288: 
SMH99_26490) and spaid AC (gene ID: SMSE_25110) are located on 
these extra chromosomes in MSRO-H99 (contig 4) and MSRO-SE 
(contig 2), respectively. 
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Extended Data Fig. 3 | Genetic alterations of the spaid locus in the 
partial male-killing S. poulsonii strain. The genome structures around 
the spaid loci in the male-killing (a, MSRO-Ug and MSRO-H99) and the 
partial male-killing (b, MSRO-SE) S. poulsonii strains. Genes encoded 
on opposite strands are shown in different colours (red and blue, 
respectively). An 828-bp deletion and nucleotide substitutions (coloured 
in red; corresponding amino acid sequences are presented in one-letter 
code) in the 3’ region of the spaid gene are indicated. These sequence 
alterations were confirmed using Sanger sequencing (see Methods). 
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Extended Data Fig. 4 | Neural defects of Spaid-expressing embryos. 
Representative images of stage 13-14 female (a, n= 14) and male 
(b, n= 16) embryos maternally expressing Spaid and stained for TUNEL 
(green) and neural cells (Elav, magenta). Single-channel images of Elav are 
also shown. The outlined region in b is magnified in c with single-channel 
images of Elav and TUNEL. Embryos were co-stained for Elav, TUNEL, 
Sxl and DNA, and selected channels are shown in a-c and Fig. 2a, b, 
respectively. 
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expressing the MSL complex (H83M2 transgene), stained for TUNEL of embryos) are shown at the bottom. e, f, Representative images of 
(green) and Sxl (magenta). GFP-expressing control female (a, n= 15), epithelial cells of stage 8-10 male embryos expressing GFP (e, n = 25) and 
Spaid-expressing female (b, n= 20), and male (c, n= 19) embryos are Spaid (f, n = 25), stained for DNA (green) and MSL1 (magenta) from the 
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stages 13-14. Different characters indicate significantly different groups 
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Extended Data Fig. 6 | Expression of Spaid using a weak GAL4 driver. 
The number of adult progeny (females, red; males, blue) obtained from 
crosses between the armadillo-GAL4 driver line (weak and ubiquitous 
expression) and four UAS transgenic lines (GFP, Spaid, AANK, and 
AOTU; n=6 independent crosses for GFP and Spaid, n = 8 independent 
crosses for AANK and AOTU). The UAS-GFP line was used as a negative 
control. With this weak GAL4 driver, Spaid still eliminated all male 
progeny, while both AANK and AOTU had no impact on male viability. 
An asterisk indicates the statistically significant difference (P < 0.01; 
N.S., not significant, P > 0.05; two-tailed Mann-Whitney U test; see 
Supplementary Table 2). Box and dot plots are as in Fig. 1c. The total 
numbers of adult counts for each genotype and sex are shown at the 
bottom. 
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Extended Data Fig. 7 | A proposed model for Spaid-induced male- 
killing phenotypes. Spaid utilizes the OTU domain and ankyrin repeats 
(ANK) to target the host nucleus and the male X chromosome. ‘MSL 
and ‘Ac indicate the dosage compensation complex and resultant histone 
acetylation, respectively. See text for other explanations. 
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Chromatin analysis in human early development 
reveals epigenetic transition during ZGA 
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Upon fertilization, drastic chromatin reorganization occurs during 
preimplantation development!. However, the global chromatin 
landscape and its molecular dynamics in this period remain largely 
unexplored in humans. Here we investigate chromatin states in 
human preimplantation development using an improved assay for 
transposase-accessible chromatin with high-throughput sequencing 
(ATAC-seq). We find widespread accessible chromatin regions 
in early human embryos that overlap extensively with putative 
cis-regulatory sequences and transposable elements. Integrative 
analyses show both conservation and divergence in regulatory 
circuitry between human and mouse early development, and 
between human pluripotency in vivo and human embryonic stem 
cells. In addition, we find widespread open chromatin regions before 
zygotic genome activation (ZGA). The accessible chromatin loci are 
readily found at CpG-rich promoters. Unexpectedly, many others 
reside in distal regions that overlap with DNA hypomethylated 
domains in human oocytes and are enriched for transcription 
factor-binding sites. A large portion of these regions then become 
inaccessible after ZGA in a transcription-dependent manner. 
Notably, such extensive chromatin reorganization during ZGA is 
conserved in mice and correlates with the reprogramming of the 
non-canonical histone mark H3K4me3, which is uniquely linked 
to genome silencing*°. Taken together, these data not only reveal a 
conserved principle that underlies the chromatin transition during 
mammalian ZGA, but also help to advance our understanding of 
epigenetic reprogramming during human early development and 
in vitro fertilization. 

We sought to investigate accessible chromatin in human preimplan- 
tation embryos using ATAC-seq’. Our optimized ATAC-seq protocol 
(miniATAC-seq; Methods) can acquire high-quality data using as 
few as 20 cells (Extended Data Fig. la-c). We collected two-cell and 
eight-cell human embryos, inner cell masses (ICMs) from embryonic 
day 5 (E5) human blastocysts, and human embryonic stem (ES) cells 
(Extended Data Fig. 2a). The isolated ICMs are post lineage segregation 
and probably include epiblast and primitive endoderm cells (Extended 
Data Fig. 2b). We also performed RNA sequencing (RNA-seq) analysis 
for each stage and validated these data (Extended Data Fig. 2c, d). 
We then conducted two replicates of ATAC-seq for each stage, and 
an additional two replicates using triploid embryos (derived from 
zygotes with three pronuclei, or 3PN), which are more available dur- 
ing in vitro fertilization (IVF). All replicates showed highly consistent 
results (Fig. 1a, Extended Data Fig. 3a, b and Supplementary Table 1). 
We identified 22,977, 40,426, 44,907 and 46,172 ATAC-seq peaks for 
the two-cell embryos, eight-cell embryos, ICMs and ES cells, respec- 
tively, with comparable genome coverages to that for ES cell DNase 
sequencing (DNase-seq; Extended Data Fig. 3c). ICM ATAC-seq peaks 


have the highest overlap with ES cell DNase I hypersensitive sites, and 
the overlap decreased in early-stage embryos (Extended Data Fig. 3d). 
NANOG is induced at the eight-cell stage, which coincides with ATAC- 
seq enrichment at promoters and nearby putative enhancers (Fig. 1b). 
POUSF1 is slightly upregulated at the eight-cell stage and is highly 
activated in the ICMs and ES cells (Fig. 1b). Notably, ATAC-seq peaks 
near POUS5F1 transit from intronic regions to promoter upstream loci 
from eight-cell human embryos to ES cells, indicating a likely switch 
of regulatory elements between naive and primed pluripotency, as 
in the mouse®. Concordant gene expression and ATAC-seq was also 
found for ZSCANS5B (Extended Data Fig. 3e). Genome-wide, as human 
major ZGA starts between the four- and eight-cell stages’, we identified 
stage-specifically expressed genes and examined their promoter 
accessibility (Extended Data Fig. 4a). We found promoters with con- 
stantly high accessibility that are preferentially CpG-rich as reported 
previously®, whereas promoters with constantly low accessibility are 
generally CpG-poor. Interestingly, a group of genes showed dynamic 
promoter accessibility that correlates with gene expression. These genes 
preferentially function in development, differentiation and morpho- 
genesis, as also evidenced by their histone 3 Lys27 trimethylation 
(H3K27me3) enrichment in ES cells and fibroblasts (Extended Data 
Fig. 4b, c). Therefore, promoter accessibility in early human develop- 
ment correlates with both gene activities and CpG densities (Extended 
Data Fig. 4d). 

Next, we sought to investigate distal open chromatin loci. Certain 
repeats are highly active in mammalian preimplantation embryos”. 
In contrast to ES cells and somatic cells, distal ATAC-seq peaks pref- 
erentially enrich for repeats at the eight-cell stage, especially for Alu, 
SINE and long-terminal repeat (LTR) retrotransposons (Extended Data 
Fig. 5a). ERVK, SVA and ERV1 are enriched at both the eight-cell and 
ICM stages. Both ERVK and SVA are expressed in human preimplan- 
tation embryos”. Thus, accessible chromatin is extensively shaped by 
transposable elements upon human ZGA. As distal accessible chromatin 
frequently harbours transcription factor-binding sites, we attempted 
to infer key factors that may regulate transcription circuitry in early 
development. We first identified stage-specific distal ATAC-seq peaks 
(Extended Data Fig. 5b). Eight-cell specific peaks are generally present 
near genes involved in RNA localization and processing. By contrast, 
distal peaks in ES cells, but not those in the ICM, are preferentially asso- 
ciated with genes involved in SMAD/activin signalling and stem-cell 
regulation. Notably, activin A is a core signalling module that maintains 
primed ES cells but destabilizes naive pluripotency"”. Interestingly, we 
also observed widespread open chromatin in the pre-ZGA two-cell 
embryos (Extended Data Fig. 5b). However, unlike in other stages, their 
predicted targets (Methods) generally lack transcripts at the two-cell 
stage. It is possible that pre-ZGA accessible elements bind transcription 
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Fig. 1 | Accessible chromatin landscape in human preimplanation 
embryos. a, The UCSC browser view showing ATAC-seq signals in 
human early embryos (two biological replicates; rep 1 and rep 2). b, The 
UCSC browser views and heat maps showing ATAC-seq enrichment 
near representative genes and their expression levels detected by RNA- 
seq. Open chromatin regions are shaded. FPKM, fragments per kilobase 
of transcript per million mapped reads. c, Transcription factor motifs 


factors, but do not serve as conventional enhancers to regulate nearby 
genes (discussed later). Next, HOMER"! analysis unveiled highly 
stage-specific transcription factor motifs in distal peaks (Fig. 1c). For 
example, ‘maternal transcription factors’ (two-cell stage and onwards) 
includes CTCF, KLF and OTX2. ‘Minor ZGA transcription factor 
(two-cell stage) includes DUX4, a key activator of early ZGA genes”. 
‘Major ZGA transcription factor’ (eight-cell and onwards) includes 
GSC, PITX1, TFAP2C and GATA factors. ‘Pluripotency transcription 
factors’ (ICMs and ES cells) contains SOX2 and POUS5F1. Notably, 
although conventional ES cells are typically derived from ICMs, they 
are generally in a primed pluripotent state!°. However, ES cells could 
also be maintained in a naive state that corresponds to an earlier 
developmental stage’”. Therefore, we performed ATAC-seq for naive 
human ES cells'*’4 (Methods). Interestingly, both KLF and TFAP2C 
are strongly enriched in ICMs and naive ES cells but not primed ES 
cells (Fig. 1c). KLF factors are known to have crucial roles in naive 
pluripotency!°!°. TFAP2C is a key germline transcription factor and 
also a regulator for trophectoderm in mice’®. Interestingly, TFAP2C 
is highly expressed in all lineages in human blastocysts (Extended 
Data Fig. 5c), suggesting human-specific functions. Finally, putative 
enhancer usages near POUS5F1 in naive ES cells are also more similar 
to the ICM than primed ES cells (Extended Data Fig. 5d). Notably, 
both naive and primed ES cells differ from the ICM as they lack the 
GATA factors that regulate the primitive endoderm lineage'’. These 
data indicate that the regulatory network of naive human ES cells is 
indeed more similar to that of ICMs. 

We then compared human and mouse transcriptional programs 
using previous results or ATAC-seq datasets from mouse early 
embryos'®, mouse ES cells (naive)'® and mouse epiblast stem cells 
(EpiSCs; primed)!°. CTCE KLE SOX2, POU5F1, GATA and TEAD are 
all well conserved between human and mouse (Fig. 1d). We also found 
species-specific transcription factors for preimplantation development 
including GSC, OTX2 and PITX1 (human) and ESRRB, NR5A2 and 
RARG (mouse)'® (Fig. 1d). Notably, GSC (mesendoderm), OTX2 (neu- 
ronal lineage) and PITX1 (limb development) are all lineage regulators. 


identified from distal ATAC-seq peaks at each stage in human embryos 
(two-cell, eight-cell and ICM) and human ES cells (primed, naive 1'* and 
naive 21%). d, Schematic representation of putative regulatory transcription 
factors in early development for those shared by human and mouse 

as well as those specific to each species. Only enriched and expressed 
transcription factors are shown. 


KLF motifs are again enriched in ICMs and mouse ES cells but not in 
mouse EpiSCs. Hence, human and mouse have both conserved and 
divergent regulatory circuitry for early development. 

We also observed widespread open chromatin at the two-cell stage 
(Fig. la). To determine whether this may be associated with minor 
ZGA’, we identified genes (n = 1,018) with increased transcript levels 
at the two-cell stage compared to metaphase II (MII) oocytes. 
Unexpectedly, most of these transcripts are also present in germinal 
vesicle oocytes in our study and also in MII oocyte using a total RNA- 
seq dataset”” (Fig. 2a, left and 2b). Importantly, the eight-cell-specific 
genes, but not the two-cell-specific genes, are sensitive to the tran- 
scription inhibitor «-amanitin (Extended Data Fig. 6a and Methods). 
Consistently, two-cell-specific genes on average only show basal levels 
of promoter ATAC-seq enrichment (Extended Data Fig. 6b). Shortening 
poly(A) tails in oocytes is thought to prevent maternal mRNA trans- 
lation without triggering destabilization”’. Thus, post-transcription 
regulation, such as deadenylation in oocytes and polyadenylation in 
early embryos, may account for a large portion of the differences between 
mRNA-seq data (this study and earlier studies?*?) and total RNA- 
seq”’. After excluding maternal genes (expressed in germinal vesicle 
or MII oocytes), we were able to identify 75 potential minor ZGA 
genes activated at the two-to-four-cell stages (Extended Data Fig. 6c), 
and this is far fewer than the number of accessible ATAC-seq peaks 
we detected. However, it is worth noting that we cannot rule out the 
possibilities that low-input RNA-seq did not fully capture pre-ZGA 
transcriptional activities. We then asked whether two-cell accessible 
chromatin may poise genes for activation. Indeed, among all two-cell 
accessible promoters (maternal genes excluded) (n = 1,891), 86% are 
shared by the eight-cell embryos (Fig. 2c). These promoters are prefer- 
entially CpG-rich and transcribed at the eight-cell stage (Fig. 2a, right, 
and Fig. 2c). Conversely, eight-cell ZGA genes preferentially have 
accessible promoters at the two-cell stage (44% versus 29% for random 
genes). Notably, this is not observed for two-cell-specific open promoters 
(Fig. 2c and Extended Data Fig. 6d). An expanded analysis revealed 
that these promoters tend to reside in DNA hypomethylated regions in 
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Fig. 2 | Transcription and promoter regulation in human early 
development. a, The UCSC browser view for genes showing transcripts 
but without accessible promoters (left), or vice versa (right) at the two- 
cell stage in humans. Total RNA-seq data were previously published”®. 

b, Heat maps showing the expression levels (mRNA-seq) of differentially 
expressed genes between MII oocytes and two-cell embryos (left). Their 
expression level in oocyte and early embryos detected by total RNA-seq”” 
are also shown (right). Cleavage stage denotes pooled two-to-eight-cell 


oocytes and blastocyst (despite its global hypomethylation)** (Extended 
Data Fig. 6e) (discussed later). Finally, by performing ATAC-seq on 
3PN one-cell and four-cell embryos, we found that accessible chromatin 
at both the one- and four-cell stages highly resembles that of the 
two-cell stage (Fig. 2c and Extended Data Fig. 7a—-c). Hence, pre-ZGA 
accessible chromatin is established in zygote and transits to a post- 
ZGA state at the eight-cell stage. In summary, these results suggest 
that some, but not all, pre-ZGA accessible promoters may poise genes 
for activation. 

Interestingly, two-cell embryos have abundant distal ATAC-seq 
peaks (n = 14,770) with low CpG densities (Fig. 3a and Extended Data 
Fig. 8a), raising the question of whether they reflect the binding of tran- 
scription factors before ZGA’. In total 32% of them are shared by the 
eight-cell embryos and they are indeed enriched for motifs of maternal 
transcription factors such as CTCE, KLF and OTX2 (Extended Data 
Fig. 8b). The rest only occur at the two-cell stage and seem unlikely 
to poise chromatin for local gene activation (Extended Data Figs. 5b 
and 8b). These loci still enrich for enhancer marks (H3K4mel and 
H3K27ac) in human ES cells and transcription factor-binding sites”, 
albeit at lower levels (Fig. 3a,b and Extended Data Fig. 8c). These peaks 
tend to reside in DNA hypomethylated regions in human oocyte and 
blastocyst”4 (Fig. 3a, c). Similar patterns are found for the one- and 
four-cell stages, but are not for ATAC-seq peaks that are specific for 
other stages or methylomes of sperm or human ES cells. In human 
oocytes, untranscribed regions are poorly methylated”, forming par- 
tially methylated domains (PMDs). We found that 73% of two-cell 
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embryos. c, Heat maps showing ATAC-seq enrichment around accessible 
promoters identified at both the two- and eight-cell stages (n = 1,444), 
only at the two-cell stage (nm =375) or only at the eight-cell stage (n = 82) 
(left). Gene expression levels and promoter CpG density (number of 
CpGs per base pair) are also shown (right). Maternal genes (FPKM > 1 
in either MII or germinal vesicle (GV) oocytes) were excluded to avoid 
confounding effects. TSS, transcription start site. 


peaks are present in human oocyte PMDs (Extended Data Fig. 8d). 
Next, we asked whether the switch of chromatin accessibility during 
ZGA depends on transcription. We treated 3PN human zygotes with 
a-amanitin (with derived embryos term as transcription-blocked 
embryos, or TBEs). Both TBEs and controls developed normally to 
eight-cell embryos, which were collected at the late eight-cell stage for 
ATAC-seq analyses. We found that ATAC-seq enrichment in TBEs 
is highly similar to pre-ZGA embryos but is distinct from post-ZGA 
embryos (Fig. 3a, d). Distal open chromatin of TBEs, but not controls, 
is strongly enriched in oocyte PMDs (Extended Data Fig. 8e, f). Thus, 
the transition of accessible chromatin landscapes during human ZGA 
is transcription dependent. 

We and others previously showed that in the mouse, oocyte PMDs 
are uniquely marked by non-canonical forms of histone modifica- 
tions H3K4me3 and H3K27me3 (but preferentially in non-overlapping 
sub-regions)**?°, Depletion of H3K4me3 results in silencing defects 
in mature oocytes**. Non-canonical H3K4me3 (ncH3K4me3) is 
further inherited to mouse early embryos after fertilization, before 
being rapidly erased after ZGA possibly by the activated demeth- 
ylases KDM5A and KDM5B*. After ZGA, human KDMS5B is 
similarly induced (Extended Data Fig. 9a) and H3K4me3 is decreased 
based on immunofluorescence analysis’. Therefore, we asked 
whether pre-ZGA accessible chromatin may be associated with histone 
modifications such as ncH3K4me3. As it is challenging to perform 
chromatin immunoprecipitation followed by sequencing (ChIP-seq) 
in human embryos, we compared histone modifications in the mouse 
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Fig. 3 | Distal accessible chromatin in human early development. a, The 
UCSC browser view showing DNA methylation levels (methylcytosine, 

or mC) in human sperm, oocyte”, blastocyst”4, ATAC-seq enrichment 

in human early embryos, TBEs and ES cells, and H3K4mel in human 

ES cells (ENCODE). Ctrl, control. b, The enrichment of ENCODE 
transcription factor-binding site (TFBS) density around two-cell-specific 
or two-to-eight-cell shared distal peaks. A random set of peaks that match 
the lengths of individual two-cell-specific peaks was used as a control. 
The upstream and downstream regions are 3 x peak lengths away from a 
peak boundary. c, Heat maps showing the oocyte, blastocyst, sperm and 
ES cell (ENCODE) DNA methylation levels around stage-specific distal 
ATAC-seq peaks (zygote n = 889, two-cell n = 622, four-cell n = 930, 
eight-cell n = 3,382, ICM n=1,777, ES cell n= 3,482). The upstream and 
downstream regions are 2 x peak lengths away from a peak boundary. 

d, Hierarchical clustering results of whole-genome accessible chromatin 
states for human early embryos and TBEs. 


zygote? to chromatin accessibility detected by DNase-seq in isolated 
mouse pronuclei’®. Notably, distal open chromatin in maternal pronu- 
clei, but not paternal pronuclei, strongly overlaps with oocyte PMDs 
and maternal ncH3K4me3 in mouse zygotes (Fig. 4a and Extended 
Data Fig. 9b). Both open chromatin and ncH3K4me3 are absent from 
PMDs after ZGA. Similar correlation could be extended to the paternal 
allele. We found the paternal H3K4me3, which is re-established 
after fertilization and briefly exists before ZGA%, also co-localizes 
with paternal open chromatin (Extended Data Fig. 9c, d). Finally, 
we found that in full-grown oocytes, DNase-seq peaks also preferen- 
tially reside in oocyte PMDs and overlap with ncH3K4me3 (Fig. 4a). 
Thus, these data demonstrate a surprising correlation between non- 
canonical H3K4me3 and open chromatin in mouse mature oocytes and 
pre-ZGA embryos. 

To determine whether the accessible chromatin transition during 
mouse ZGA is also transcription-dependent, we treated pronu- 
clear stage 3 (PN3) zygotes with a-amanitin, which arrested mouse 
embryos at the late two-cell stage. We collected TBEs for ATAC-seq 
analyses when the control group developed to a later stage (eight-cells, 
after 45h). On the basis of single nucleotide polymorphisms (SNPs) 
between the parental strains, we found the ATAC-seq enrichment in 
TBEs is similar to that of zygote DNase-seq in an allele-specific manner 
(Extended Data Fig. 10a—c). Distal open chromatin of TBEs on the 
maternal allele persisted in oocyte PMDs in which ncH3K4me3 is also 
retained after inhibition of transcription? (Fig. 4a). The correlation 
between accessible chromatin and H3K4me3 again extends to the pater- 
nal genome in TBEs (Extended Data Fig. 10a, d). Thus, the transition of 
both accessible chromatin and H3K4me3 during mouse ZGA depends 
on transcription. Interestingly, despite allele-specific accessibility, we 
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Fig. 4 | Conservation of chromatin transition during ZGA in mouse and 
human. a, Heat maps showing allelic open chromatin (DNase-seq (oocyte 
and zygote)’ or ATAC-seq (other samples)), H3K4me3* and H3K27me3”° 
enrichment around the oocyte PMDs (n = 24,241) in mouse oocyte, 
zygote, 2-cell embryo, TBE and its control. The upstream and downstream 
regions are 1 x PMD length away from the PMD boundary. b, A model 
shows the transcription and chromatin states before and after ZGA in 
human and mouse. In pre-ZGA embryos or TBEs, accessible chromatin 
occurs at CpG-rich promoters and distal regions preferentially in oocyte 
PMDs. After ZGA, although high-CpG accessible promoters remain 

open, a large fraction of pre-ZGA distal accessible regions are lost. Newly 
established accessible chromatin mainly occurs at putative regulatory 
elements such as enhancers. In mouse, dynamics of distal accessible 
chromatin also correlates with H3K4me3 in an allele-specific manner 

(not shown) before and after ZGA. Histone modification states in human 
are unknown. 


identified very similar motifs in distal accessible chromatin on both 
alleles in zygotes and TBEs, mainly for maternal factors (CTCF and 
KLFs, such as KLF7, KLF11 and KLF17) (Extended Data Fig. 10e). As 
a control, transcription factors activated after ZGA (RARG, NR5A2, 
ESRRB, TEAD4 and GATA)'8 are not enriched, suggesting that similar 
maternal transcription factors interact with allele-specific chromatin 
landscapes before ZGA. 

Using highly sensitive ATAC-seq, our data revealed both conserved 
and species-specific transcriptional regulatory networks in human and 
mouse early embryos. An unusual pre-ZGA chromatin state exists in 
both human and mouse, featured by extensive accessible chromatin 
despite the lack of transcription (Fig. 4b). Although accessible promoters 
preferentially enrich for CpGs, distal accessible chromatin strongly 
overlaps with oocyte PMDs and, in the case of mouse, ncH3K4me3. 
Transcription is crucial for the transition of H3K4me3 and chromatin 
accessibility upon ZGA. One critical question is whether pre-ZGA epig- 
enomes in turn regulate transcription or the genome silencing. The 
accessible regions shared between pre- and post-ZGA stages may have 
a role in poised activation. By contrast, repressive marks can prevent 
faulty transcription during drastic epigenetic reprogramming. For 
instance, oocyte-deposited H3K27me3 is suggested to restrict enhancer 
function in fly early embryos” and regulate X-chromosome inacti- 
vation and allelic expression in mouse embryos”*”*”. Interestingly, 
maternal H3K4me3 also facilitates genome silencing in oocytes*>. It 
is possible that ncH3K4me3 may work as sponges to facilitate docking 
or sequestering transcription resources when the genome is silenced. 
Alternatively, accessible chromatin may be created by unknown factors 
as ‘chromatin harbours’ that allow the deposition of ncH3K4me3 and 
docking of transcription factors. Future studies are warranted to test 
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these possibilities. Taken together, our study not only presents a global 
view for dynamic chromatin landscape in human embryos, but also 
revealed epigenomic transition during ZGA that are likely to be con- 
served between human and mouse. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 


No statistical methods were used to predetermine sample size, and investigators 
were not blinded to allocation during experiments and outcome assessment. 
Ethics statement. This study was approved by the Institutional Review Board (IRB) 
of The First Affiliated Hospital of Zhengzhou University (2015KY-NO.31) and 
Tsinghua University (20170009), China. In accordance with the measures of the 
People’s Republic of China on the administration of Human Assisted Reproductive 
Technology, the ethical principles of the Human Assisted Reproductive Technology 
and the Human Sperm Bank as well as the Helsinki declaration. The research 
followed the guiding principles of the Human Embryonic Stem Cell Ethics 
issued by the MOST and MOH and was regularly reviewed by the Medical Ethics 
Committee of The First Affiliated Hospital of Zhengzhou University. All gametes 
and embryos were collected voluntarily after obtaining written informed consent 
signed by donor couples at the Center for Reproductive Medicine in The First 
Affiliated Hospital of Zhengzhou University. Oocytes were obtained from the 
female tubal factors of infertility volunteers who have already had one or two 
healthy children from natural pregnancy or IVE. The eight-cell and blastocyst 
stages embryos were donated by couples who already had a healthy baby through 
IVE, with written informed consents. In the process, couples were informed that 
their gametes and embryos would be used to study the chromatin state and its 
regulation during human embryo development, and that their donation would 
not affect their IVF cycle. 

Human gamete and early embryo collection. All gametes and embryos were 
collected from volunteers who were between 25 and 30 years old, with normal 
chromosome karyotypes and no family heredity case history, contagion case 
and smoke history. Healthy sperm were cryopreserved in liquid nitrogen using 
cryoprotectants. SpermGrad (Vitrolife) was used for gradient sperm separation. 
For oocytes, the controlled ovarian stimulation were carried out using GnRH 
analogues combined with human menopausal gonadotrophins or recombinant 
follicle stimulating hormone (FSH) for pituitary desensitization. Transvaginal 
ultrasound-guided oocyte collection was scheduled 36h after hCG administration. 
Intra cytoplasmic sperm injection (ICSI) was used in this investigation rather than 
classical IVF to prevent contamination of sperm and cumulus cells. Fertilization 
was assessed 17-20h after ICSI and embryo cleavage was recorded every 24h. 
Two-cell stages embryos were frozen for storage and sample pooling, before being 
thawed and subjected to ATAC-seq and RNA-seq. All two-cell and eight-cell 
embryos and blastocysts used in this research have normal development rates and 
morphology. The zona pellucida of all embryos used in this study was mechanically 
removed to avoid blocking the reaction of Tn5 and the potential contamination of 
cumulus cells. The ICMs of day 5 blastocysts were isolated mechanically according 
to a method previously reported*". In brief, the biopsy pipette was inserted into 
the blastocoel, and the ICM cells were sucked in and gently pulled out with laser 
assistance. Numbers of embryos used for this study (2PN) are: two-cell rep1: 15; 
two-cell rep2: 17; eight-cell rep1: 8; eight-cell rep2: 10; ICM rep1: 3; ICM rep2: 3. 
For the 3PN embryos, the 3PN zygotes were identified after ICSI or IVE, which 
were collected at the one-cell stage or further cultured to the two-, four- or eight- 
cell stage and blastocyst using G-1 (Vitrolife) human embryos culture medium. 
G-2 (Vitrolife) medium was used from the eight-cell to blastocyst stage as normal 
2PN embryos. Only the high-quality 3PN embryos and blastocysts without devel- 
opment arrest were collected. 

Treatment of human embryos with a-amanitin. The 3PN zygotes were identified 
and collected after IVF or ICSI. About 50 3PN zygotes were thawed and transferred 
to G1 media in the presence or absence of «-amanitin (25 ng’, Sigma-Aldrich). 
After the control and «-amanitin-treated embryos reaching the eight-cell stage, 
high-quality embryos were collected and those with fragments or arrested at other 
stages were discarded. The zona pellucida was gently removed by laser. The cells 
were then washed three times with PBS to avoid any potential contamination then 
further subjected to ATAC-seq library preparation. 

Mouse embryo collection. All animal experiments were approved by Institutional 
Animal Care and Use Committee (IACUC) of Tsinghua University, Beijing, China. 
Pre-implantation embryos were collected from 5-6-week-old C57BL/6N females 
(Vital River) mated with PWK/Ph] males (Jackson Laboratory). The embryo 
collection was performed as described previously!®. To inhibit transcription in 
early embryos, PN3 zygotes were cultured in CZB supplemented in the absence or 
presence of a-amanitin (125 }.gml-!) for about 45h. The resulting embryos were 
subjected to ATAC-seq analyses. 

Cell culture of human ES cells. For primed human ES cell culturing, ES cells 
(H1) were maintained on Matrigel (BD Biosciences)-coated plates (Corning) in 
E8 medium (STEMCELL Technologies) in a feeder-free and serum-free manner. 
Naive-state human ES cells were generated as described previously with some 
modifications!*"“, In brief, for t2iL/G6-human ES cell derivation", the H7 human 
ES cells were seeded on matrigel (BD Biosciences) coated plate 48h before virus 
infection. Subsequently, cell culture medium was changed to fresh human ES cell 
culture medium supplemented with 3 virus concentrates containing FUW-human 
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NANOG, FUW-human KLF2 and FUW-Ma2rtTA in the presence of 8 1g ml"! 
polybrene for 24h. Two days later the human ES cells were passaged using Accutase 
(Thermo Fisher) and seeded on mouse embryonic fibroblast (MEF) feeder layers 
in human ES cell culture medium with 101M Y27632. After 24h, medium was 
changed to t2iL/G6 medium for further culture. After 7-14 days of culture, colonies 
with naive mouse ES cell-like morphology appeared and were picked to derive 
the t2iL/G6-human ES cells with t2iL/G6 medium. Note that the t2iL/G6-human 
ES cell proliferation rate at the initiation stage of the conversion was very low, 
and initial passage of t2iL/G6-human ES cells were pursued every 6-7 days using 
Accutase. The 5iL/A-human ES cell derivation! was similar to the t2iL/G6-human 
ES cell derivation, but after virus transfection, the transfected ES cells were directly 
cultured in 5iL/A medium supplemented with 0.5% KOSR, 8 ng ml~! FGF2 and 
2g ml! DOX. After 2 weeks of culture, colonies with naive mouse ES cell-like 
morphology could be isolated and expanded in 5iL/A medium without KOSR, 
FGF2 and DOX. All cell culture was performed under conditions of 5% oxygen 
at 37°C. 

The conventional H7 human ES cells were cultured on mitomycin C-inactivated 
MEF feeder layers in human ES cell culture medium containing 20% knockout 
serum replacement (KOSR) (Thermo Fisher), 78% DMEM/F12 medium (Thermo 
Fisher), 1% NEAA (Thermo Fisher), 2mM GlutaMax (Thermo Fisher), 0.1 mM 
8-mercaptoethanol (Thermo Fisher), and 8 ng ml~! FGF2 (R&D). The t2iL/Gé 
medium contains DMEM/DF12 and Neurobasal medium mixed at 1:1 ratio, 
0.5 x N2 supplement, 0.5 x B27 supplement, 2mM GlutaMax, 1% NEAA, 0.1 mM 
8-mercaptoethanol, 501g ml! BSA, 101g ml insulin, 144M PD0325901, 11M 
CHIR99021, 50,.g ml“! t-ascorbic acid (Sigma-Aldrich), 10ng ml“! human 
LIF, 2.511M G66983 (Sigma-Aldrich) and 10\1M Y27632 (Selleck). The 5iL/A 
medium contains DMEM/DF12 and Neurobasal medium mixed at 1:1 ratio, 
0.5 x N2 supplement, 0.5 x B27 supplement, 1 mM GlutaMax, 1% NEAA, 0.1 mM 
8-mercaptoethanol, 501g ml~! BSA, 14M PD0325901, 20ng ml! human LIF, 
1M IM-12 (Enzo), 0.51.M SB590885 (R&D), 14M WH-4-023 (A Chemtek), 
104M Y27632 and 20ng ml activin A (Peprotech). 

Plasmid cloning and viral preparation. For construction of lentiviral vectors 
containing the human NANOG and KLF2 genes, the homan NANOG and KLF2 
coding sequences were cloned from human ES and fibroblast cell cDNA libraries, 
and inserted after the TetO/CMV promoter of the FUW lentivirus vector for DOX- 
inducible expression, respectively. Lentiviral particles containing FUW-human 
NANOG, FUW-human KLF2 and FUW-MartTA (Addgene ID: 20342) were pack- 
aged in 293FT cells with a VSVG coat. Viral supernatants were collected at 48h 
after transfection, and then were filtered using a 0.45-j1m filter and concentrated 
by high-speed centrifugation. 

miniATAC-seq library generation and sequencing. The miniATAC-seq librar- 
ies were prepared as previously described with further modifications primarily 
for DNA purification”. In brief, samples were transferred into 61 lysis buffer 
(10 mM Tris-HCl (pH 7.4), 10 mM NaCl, 3mM MgCl, and 0.5% NP-40) for 10 min 
on ice. After lysis, 441 ddH,O, 441 5 x TTBL, 5 pl TTE mix V5 (TD502, Vazyme) 
were added to the sample, which was mixed and incubated at 37°C for 30 min. 
The reaction was terminated by incubating with 511 5 x TS stop buffer (TD502, 
Vazyme) at the room temperature for 5 min, before 2 il carrier RNA (20ngyl! 
after 50 x dilution) (EpiTect Fast DNA Bisulphite Kit 59824) and 103 il Tris-EDTA 
(TE) buffer was added to the sample. The sample was then transferred to a 1.5-ml 
tube. After adding 130 11 phenol-chloroform (vortexed and incubated at room 
temperature for 3 min), the sample was transferred to a phase-lock tube (WM5- 
2302820 TIANGEN). After spinning at maximum speed for 15 min, the superna- 
tant was transferred to a new 1.5-ml tube, and 6501 ethanol, 2411 sodium acetate 
(3M) and 211 glycogen were added for DNA precipitation at —20°C overnight. 
The next day, DNA pellet was spun down at maximum speed for 15 min at 4°C, 
washed with 75% ethanol, air dried, and resuspended in 29 1] H2O. The phenol- 
chloroform purification with carrier RNA improves the efficiency for both DNA 
purification and removal of residual Tn5 transposase, which may otherwise inter- 
fere with subsequent reactions. DNA was transferred to a 0.2-ml PCR tube, and 
5 ul NSXX primer, 5 jul N7XX primer (TD202, Vazyme), 1015 x TAB and 1 ul TAE 
(TD502, Vazyme) were added. PCR was performed to amplify the library for 18 
cycles for about 30 embryos or ES cells using the following PCR conditions: 72°C 
for 3 min; 98°C for 30s; and thermocycling at 98°C for 15s, 60°C for 30s and 72°C 
for 3 min; following by 72°C 5 min. After the PCR reaction, libraries were purified 
with the 1.5 x AMPure (Beckman) beads and were subjected to next-generation 
sequencing. 

RNA-seq library generation and sequencing. The Smart-seq2 libraries of 
human oocytes and early embryos were prepared using SMART-Seq v4 Ultra 
Low Input RNA Kit for Sequencing (Clontech, 634888) also as previously 
described**. Oocytes and early embryos from the same batch were divided into 
two parts, with 95% cells used for ATAC-seq and the rest for RNA-seq. Oocytes 
and early embryos (two-cell, four-cell and eight-cell stage, and ICMs) were 
lysed in lysis buffer containing RNase inhibitor according to the user manual. 
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The library was quantified using Qubit and Agilent 2100 before being subjected 
to sequencing. 
ATAC-seq data processing. The single-end ATAC-seq reads were aligned to 
mm9 or hg19 reference genome with random chromosome cleaned by Bowtie 
(version 2.2.2) under the parameters -t -q -N 1 -L 25. The paired-end ATAC-seq 
reads were aligned with the parameters: -t -q -N 1 -L 25 -X 2000 -no-mixed -no- 
discordant. All unmapped reads, non-uniquely mapped reads and PCR duplicates 
were removed. For downstream analysis, we normalized the read counts by com- 
puting the numbers of reads per kilobase of bin per million of reads sequenced 
(RPKM). RPKM values were averaged for each bin between replicates. To minimize 
the batch and cell type variation, the RPKM values were further normalized by 
Z-score transformation. To visualize the ATAC-seq signal in the UCSC genome 
browser, we extended each read by 250 base pairs (bp) and counted the cover- 
age for each base. The correlation between ATAC-seq replicates was calculated 
as following: each read was extended 250 bp from the mapped end position and 
the RPKM value was generated on a 100 bp-window base. The ATAC-seq enrich- 
ment was then summed within each 2-kb window for the entire genome and was 
compared between replicates. Pearson correlation was calculated and was shown. 
Allele assignment of sequencing reads for mouse TBE embryos was conducted 
as described previously"®. In brief, to assign each read to its parental origins, we 
examined all SNPs in the read that showed high-quality base calling (Phred score 
> 30). For paired-end reads, SNP information from both reads in the pair was 
summed and used. When multiple SNPs were present in a read (or a read pair), the 
parental origin was determined by votes from all SNPs and the read was assigned 
to the allele that had at least two thirds of the total votes. 
Gene expression data processing. All RNA-seq data were mapped to hg19 genome 
by Tophat (version 2.4.0)*4. The gene expression level was calculated by Cufflinks 
(version 2.2.1)*° based on the hg19 refFlat annotation database from the UCSC 
genome browser. The Spearman correlation coefficients between RNA-seq repli- 
cates were calculated. For published total RNA-seq data’, the expression values 
were quantile normalized together with the mRNA-seq data in this study to allow 
further comparison. For naive human ES cell gene expression microarray data’, 
the expression values were downloaded and averaged for each gene. The averaged 
values were quantile normalized together with the FPKM value in this study to 
allow further comparison. 
DNase-seq, ChIP-seq and DNA methylation data processing. Downloaded 
DNase-seq and ChIP-seq data were mapped to mm9 genome by Bowtie with 
similar parameters as ATAC-seq data. All unmapped reads, non-uniquely mapped 
reads and PCR duplicates were removed. For downstream analysis, we calculated 
the read counts by computing RPKM on genome 100-bp bin. Downloaded DNA 
methylation data were mapped to mm9 or hg19 genome by BSMAP* with param- 
eters: -r 0 -p 16 -w 100 -v 0.1. PCR duplicates were removed. For each CpG site, the 
methylation level is calculated as the total methylated counts (combining Watson 
and Crick strands) divided by the total counts across all reads covering this CpG. 
Identification of stage-specific genes. The minor ZGA genes activated at the 
two-to-four-cell stages were identified by selecting those expressed at the two-cell 
stage (FPKM > 5) but not expressed or expressed at low levels in MII and germinal 
vesicle oocytes (FPKM <5) in both mRNA-seq and total RNA-seq, with a minimal 
twofold gene expression change between the two- or four-cell and oocyte stages. 
For stage-specific genes in eight-cell embryos, ICMs and human ES cells, a 
Shannon entropy-based method was used to identify stage-specific genes, as pre- 
viously described?’”. Owing to the possible confounding effects from maternally 
inherited RNA transcripts, ZGA-only genes were analysed, which were defined 
as those not expressed in MII or germinal vesicle oocytes (FPKM < 1). Genes 
with entropy scores of less than 2 were selected as candidates for stage-specific 
genes. Among these genes, we selected candidates of stage-specific genes for each 
stage based on the following criteria: the gene is highly expressed at this stage 
(FPKM > 10), and such high expression cannot be observed in other stages. These 
genes were then reported in the final stage-specific gene lists. 
Identification of ATAC-seq peaks and their genome coverages. All the ATAC- 
seq peaks were called by MACS v1.4** with the parameters-nolambda —nomodel. 
ATAC-seq peaks that are at least 2.5 kb away from annotated promoters from ref- 
Flat were selected as distal ATAC-seq peaks. The genome coverages of peaks from 
different samples were calculated by genomeCoverageBed® using hg19 reference 
genome. 
The comparison between ATAC-seq peaks and known cis-regulatory elements. 
To compare the ATAC-seq peaks identified in early embryos with the annotated 
cis-regulatory elements, we calculated the overlap between the ATAC-seq peaks 
of different stages and annotated promoters (TSS + 0.5 kb). Distal peaks were then 
compared to distal DNase I hypersensitive sites in ES cells. Random peaks were 
generated by selecting random regions in the genome with the sizes matching each 
individual ATAC-seq peak. 
The comparison between ATAC-seq peaks and repetitive elements. To iden- 
tify the overlap between repetitive elements and promoter or distal ATAC-seq 


peaks, the ATAC-seq peaks were compared with the locations of annotated repeats 
(RepeatMasker) downloaded from the UCSC genome browser by intersectBed*? 
with default parameters. As repeats of different classes vary greatly in numbers, 
a random set of peaks with identical lengths of ATAC-seq peaks was used for 
the same analysis as a control. The numbers of observed peaks that overlap with 
repeats were compared to the numbers of random peaks that overlap with repeats, 
and a log ratio value (log») was generated as the ‘observed/expected’ enrichment. 
Identification of stage-specific distal ATAC-seq peaks. The two-cell-specific 
distal ATAC-seq peaks were identified as those with high levels of ATAC-seq 
enrichment at the two-cell stage (normalized RPKM > 1) but not at the eight-cell 
stage (normalized RPKM <0). Those with normalized RPKM > 1 at both stages 
were identified as shared distal peaks between the two- and eight-cell stages. 

To identify stage-specific distal ATAC-seq peaks, the distal ATAC-seq peaks 
from human embryos of all stages and ES cells were combined, with overlapped 
peaks merged. The average RPKM values were calculated for these distal ATAC-seq 
peaks which were further normalized by the Z-score normalization. A Shannon 
entropy-based method” was used to identify stage-specific distal ATAC-seq peaks. 
We selected those with entropy less than 2 as candidates for stage-specific distal 
ATAC-seq peaks. The stage-specific distal ATAC-seq peaks were further defined 
based on the following criteria: the distal ATAC-seq peak has high enrichment 
at this stage (normalized RPKM > 1), and positive enrichment (normalized 
RPKM > 0) at no more than two additional stages. The resulting distal ATAC 
peaks were then reported in the final stage-specific distal ATAC-seq peak list. 
The functional enrichment for genes that are near stage specific distal ATAC-seq 
peaks was analysed using the GREAT tool” by default settings. 

Prediction of promoter targets of putative enhancers in distal ATAC-seq peaks. 
To identify the potential targeted genes for stage specific enhancer (distal peaks), 
we computed the averaged ATAC-seq enrichment (normalized RPKM) for all distal 
ATAC-seq peaks and annotated promoters (TSS + 0.5 kb). Among genes assigned 
to enhancers by GREAT analysis based on distances, we further calculated the 
correlation between the ATAC-seq enrichment at distal ATAC peaks and these 
promoters across human embryos and ES cells. The promoter with a Pearson 
correlation coefficient above 0.8 was selected as the potential target of the enhancer. 
Gene ontology analysis. The DAVID web-tool was used to identify the Gene 
Ontology (GO) terms using databases including Molecular Functions, Biological 
Functions and Cellular Components*!. 

Hierarchical clustering analysis. The hierarchical clustering was performed in 
R by hclust() function with ATAC-seq RPKM values via Spearman correlation 
coefficients. 

Motif, enhancer mark and transcription factor-binding sequence analyses for 
distal ATAC-seq peaks. To find the sequence motifs enriched in distal ATAC- 
seq peaks, findMotifsGenome.pl from the HOMER program!’ was used. Motifs 
with known match in HOMER database were selected. The ChIP-seq data for 
human ES cell H3K27ac and H3K4mel marks” and the collection of transcription 
factor-binding sites*> were downloaded from the UCSC genome browser. The 
average RPKM values at the distal peaks and their nearby regions were calculated. 
The number of transcription factor-binding sites was first binned for each 100-bp 
window, and the average enrichment at the distal peaks and their nearby regions 
was calculated. 

Identification of oocyte PMDs. PMDs in oocytes were identified as described 
previously’. In brief, published human”! and mouse“? oocyte DNA methylation 
datasets were downloaded. Average oocyte DNA methylation levels were calculated 
for 10-kb bins of the genome. Bins with average DNA methylation levels of lower 
than 0.5 were selected and merged into PMDs. Promoter regions (+2.5 kb) were 
excluded from PMDs. 

The ATAC-seq enrichment in PMD. ATAC-seq RPKM is Z-score normalized 
on the genome-wide 100-bp bin with promoter regions removed. PMDs, as well 
as their upstream and downstream regions, were divided into five bins separately, 
and the mean RPKM was calculated for each bin and shown in the heat map. The 
average ATAC-seq RPKM for all PMDs, as well as their upstream and downstream 
regions, was computed and shown. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. The data sets generated and analysed during the current study 
are available in the Gene Expression Omnibus (GEO) with the accession number 
GSE101571. 
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Extended Data Fig. 1 | Validation of miniATAC-seq. a, UCSC browser 
view showing the ATAC-seq signals using various numbers of mouse ES 
cells. b, Scatter plots comparing the miniATAC-seq signals from various 
numbers of mouse ES cells with conventional ATAC-seq using 50,000 


mouse ES cells. The Pearson correlation coefficients are also shown. c, Box 
plots showing the ATAC-seq enrichment for peaks from 50,000 mouse ES 
cells that are recaptured or missed by miniATAC-seq using lower numbers 
of mouse ES cells. 
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Extended Data Fig. 2 | Validation of RNA-seq data in human early 
embryos. a, Microscopy images of human 2PN and 3PN embryos at the 
zygote, two-cell, four-cell, eight-cell and blastocyst stages. ICSI was used 
to avoid the cumulus cell contamination. High-quality score embryos were 
selected for subsequent study (represented by magnified images). b, Heat 
maps showing gene expression levels for differentially expressed genes 
between human embryos of pre-lineage segregation and post-lineage 


specification“. Expression of ICMs in this study is also shown for two 
replicates. Epi, epiblast, PrE, primitive endoderm. c, Bar charts showing 
the Spearman correlation between the two replicates of RNA-seq 
samples. d, Hierarchical clustering of RNA-seq datasets from this study 
(using Smart-seq) and previous studies**”? (using a different mRNA-seq 
method»). Pearson correlation was used to measure distances. Different 
colours represent various stages. 
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Extended Data Fig. 3 | Validation of ATAC-seq data in human early for human ES cells (ENCODE). d, The overlap between ATAC-seq peaks 
embryos. a, Scatter plots showing the ATAC-seq signals between replicates and annotated promoters (TSS + 0.5 kb) or distal DNase I hypersensitive 
at each stage in human early development or between 2PN and 3PN sites in ES cells. A random set of peaks that match the lengths of individual 
embryos. b, UCSC browser view showing the landscape of accessible ATAC-seq peaks were used as a control. e, UCSC browser views showing 
chromatin in replicates of human early embryos. c, Bar charts showing the — the ATAC-seq and RNA-seq enrichment near a representative gene. Open 
genome coverages of ATAC-seq peaks of each stage or DNase-seq peaks chromatin regions are shaded. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a c 
H3K27me3 H3K27me3 
Promoter 
accessibility hESC IMR90 
p<8.7x10> p<t.4x10 > 
High 1.0 
(n=116) , a 3 T 
= ' = ! 
< : x 9 
2 05 ' ra 1 
3 5 81 + 
Dynamic Ss 00 == 2 
(n=183) eu 1 - ¢ 
: Giese?) 
— 
-0.5 
Promoter x ROY RS 
Low sais, RNS) s VW es 
(n=183) accessibility of of 
-0.7 53 <4 0 1 0 0.2 
FPKM(logz) Normalized RPKM — CpG density 
b Genes with promoters Genes with promoters 
of high accessibility of dynamic accessibility 
Nucleosome assembly Multicellular organism development 
Transmembrane transport Outflow tract morphogenesis 
Cellular response to heparin Regulation of stem cell division 
Protein phosphorylation Positive regulation of 
osteoblast differentiation 
0 1 2 3 4 0 4 2 3 4 
-log10 P-value 
-log10 P-value g 
d 
All Low CpG Medium CpG High CpG 
2-cell . : } : | 
TIE 0.02 > R=0.11 7 R=0.06 
o 2 4 6 8 10 o 2 4 6 8 10 0 2 4 6 8 10 
2 : > ee 
pene 8-cell is | i . 
— = R=0.11 ~af R=0.33 i R=0.41 
o + ° eo E oL 
ab) = o 2 4 6 8 10 o 2 4 6 8 10 o 2 4 6 8 10 
henna 
g Pe % ® ® ® 
& ea be : : . ose 
6 ICM ° : al F 
= . R=0.06 > R=0.27 >. R=0.36 
0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10 
hESC : : : : 
7] i, R=0.13 7 [E R=0.38 R=0.35 


0 2 4 6 8 10 


0 2 4 6 8 10 0 2 4 6 8 10 


RNA-seq log,(FPKM+1) 


Extended Data Fig. 4 | Relationship of chromatin accessibility and 
transcription in human early embryos. a, Heat maps showing three 
classes of promoter accessibility (high, dynamic and low) for stage- 
specific genes (maternal genes excluded). CpG densities and H3K27me3 
levels in human ES cells and fibroblasts (IMR90)* are also shown. b, GO 
analysis results for gene classes in a. (The ‘low’ class does not have gene 


set enrichment with P < 10~*). c, Box plots showing promoter enrichment 
of H3K27me3 in human ES and IMR90 cells*° for each class in a. P values 
based on a one-sided t-test are shown. d, Scatter plots showing promoter 
ATAC-seq enrichment and gene expression (maternal genes excluded) for 
all genes (black) or genes with promoters of low (green), medium (blue), 
or high (red) CpG densities. Spearman correlation coefficients are shown. 
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Extended Data Fig. 5 | Features of distal ATAC-seq peaks in human distal ATAC-seq peaks in human embryos. Left, GREAT analysis”° results 
embryos. a, Top, enrichment of repeats in ATAC-seq promoter and distal are also shown. Right, the expression of predicted targets among GREAT 
peaks compared to that in random peaks for early human embryos, human __ listed nearby genes for putative enhancers (distal peaks) (Methods) are 


ES cells, and somatic cell types. The enrichment was calculated as a log, shown for each stage. c, Bar charts showing the expression of TFAP2C in 
ratio for the numbers of observed peaks that overlap with repeats divided epiblast (EPI), primitive endoderm (PE) and trophectoderm (TE) lineages 
by the numbers for random peaks. A random set of peaks that match the in human embryos from embryonic days 5-7 based on a previous study‘. 
lengths of individual ATAC-seq peaks was used. Bottom, a similar analysis | d, UCSC browser view showing the ATAC-seq signals around POUS5F1 
was performed for the enrichment of repeat subfamilies in distal peaks. in human early embryos and ES cells (primed, naive 1'* and naive 214). 

b, Middle, heat maps showing the ATAC-seq signals for stage-specific Human ES cell DNase-seq data (ENCODE) are shown as a control. 
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Extended Data Fig. 6 | Transcription and chromatin states before major 
human ZGA. a, Box plot showing the expression levels of two-cell specific 
(left) and eight-cell specific (right) genes in MII oocytes and embryos with 
or without «-amanitin treatment. P values based on a one-sided t-test are 
shown. b, The average ATAC-seq enrichment for each stage is shown at the 
promoters of stage specific genes at the same stage identified by mRNA- 
seq. Two-cell ATAC-seq enrichment for a random set of promoters were 
similarly analysed as a control. c, Heat maps showing expression levels of 
possible minor ZGA genes activated at the two-to-four-cell stage for their 


expression in germinal vesicle oocytes, MII oocytes, and two-to-four-cell 
embryos. Promoter ATAC-seq enrichment for two-to-four-cell stages 
and CpG densities are also shown. d, UCSC browser view showing the 
promoter ATAC-seq signals specifically appearing in two-cell embryos. 
e, Left, heat maps showing ATAC-seq enrichment at the accessible 
promoters present in both two- and eight-cell embryos, as well as those 
specific to each stage. Right, the human oocyte, blastocyst, sperm and ES 
cell DNA methylation levels around these promoters are also shown. 
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Extended Data Fig. 9 | Accessible chromatin state in mouse oocytes regions. c, Heat map showing the Spearman correlation between allelic 
and pre-ZGA embryos. a, The expression of human KDM5B and mouse DNase-seq and H3K4me3 signals in zygotes. M, maternal; P, paternal. 
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Mouse ES cell H3K4me3 signals are also shown to mark the promoter 
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Extended Data Fig. 10 | Accessible chromatin state in mouse TBEs. a, UCSC 
browser view showing DNA methylation levels in mouse sperm and oocyte, 
and ATAC-seq signals in normal mouse embryos as well as allelic ATAC-seq 
and H3K4me3 enrichment in TBE samples. Mouse ES cell H3K4me3 signals 
are also shown to mark the promoter regions. b, Hierarchical clustering 

results showing the relationships of allelic accessible chromatin states between 
zygotes, TBEs, and two- and eight-cell embryos in mouse. c, Heat map 
showing open chromatin regions that are unique to zygotes (DNase-seq) or 
45h control embryos (ATAC-seq). The ATAC-seq enrichment in TBE samples 
is then matched and shown. d, Heat map showing the Spearman correlation 
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between allelic ATAC-seq and H3K4me3 signals in TBEs. e, Transcription 
factor motifs identified from distal DNase-seq and allelic distal ATAC-seq 
peaks are shown. Motifs shared by pre-ZGA and post-ZGA stages or are 
specific for post-ZGA stages are noted. For transcription factors that have 
multiple family members with similar motifs (KLF and GATA), the highest 
expression and motif enrichment among all family members at each stage are 
shown. A random set of peaks that match the lengths and number of zygote 
maternal peaks was used as a control. It is worth noting that the RNA levels 
of CTCF appear to decline in TBEs, presumably owing to RNA degradation 
during extended transcription inhibition. 
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Structural principles of distinct assemblies of the 
human c42 nicotinic receptor 


Richard M. Walsh Jr!?°, Soung-Hun Roh?*+5, Anant Gharpure!?, Claudio L. Morales-Perez?, Jinfeng Teng!* & Ryan E. Hibbs!?* 


Fast chemical communication in the nervous system is mediated 
by neurotransmitter-gated ion channels. The prototypical member 
of this class of cell surface receptors is the cation-selective nicotinic 
acetylcholine receptor. As with most ligand-gated ion channels, 
nicotinic receptors assemble as oligomers of subunits, usually as 
hetero-oligomers and often with variable stoichiometries’. This 
intrinsic heterogeneity in protein composition provides fine 
tunability in channel properties, which is essential to brain function, 
but frustrates structural and biophysical characterization. The 0482 
subtype of the nicotinic acetylcholine receptor is the most abundant 
isoform in the human brain and is the principal target in nicotine 
addiction. This pentameric ligand-gated ion channel assembles in 
two stoichiometries of a- and 3-subunits (20:38 and 30:26). Both 
assemblies are functional and have distinct biophysical properties, 
and an imbalance in the ratio of assemblies is linked to both nicotine 
addiction” and congenital epilepsy*. Here we leverage cryo- 
electron microscopy to obtain structures of both receptor assemblies 
from a single sample. Antibody fragments specific to 82 were used 
to ‘break symmetry during particle alignment and to obtain high- 
resolution reconstructions of receptors of both stoichiometries 
in complex with nicotine. The results reveal principles of subunit 
assembly and the structural basis of the distinctive biophysical and 
pharmacological properties of the two different stoichiometries of 
this receptor. 

Pseudo-symmetric heteromeric proteins, exemplified by the Cys- 
loop receptor family, present a challenge in cryo-electron microscopy 
(cryo-EM), because particle alignment is dominated by low-resolution 
information®’, and at low resolution the subunit identities may not be 
readily distinguishable. Heterogeneity in subunit assembly adds to the 
challenges in particle alignment. In the nicotinic receptor, the a- and 
B-subunits have nearly identical secondary structural features, which 
makes them and their multiple assemblies indistinguishable at low 
resolution. We raised monoclonal antibodies against the recombinant 
0482 nicotinic receptor and isolated high-affinity antigen-binding 
(Fab) fragments to structurally distinguish o-subunits from -sub- 
units within the pseudo-symmetric pentamer. In contrast to our earlier 
crystallographic studies, in which we optimized expression conditions 
to produce more-or-less exclusively the 20:38 assembly’, here we inten- 
tionally produced a mixed population of both stoichiometries of the 
0482 receptor (20:38 and 30:28). We co-purified the receptor—Fab 
complex, and prepared sample grids for cryo-EM (Extended Data 
Fig. la and Methods). Fab has little or no effect on [?H]-nicotine bind- 
ing and activation of the channel (Extended Data Fig. 1b-d). 

Single particle cryo-EM data collection and processing revealed two 
distinct receptor populations, one with two Fabs bound and one with 
three Fabs bound, corresponding to the 30:28 and 2:36 stoichio- 
metries, respectively (Extended Data Fig. 2). We obtained cryo-EM 
maps of the two stoichiometries to approximately 3.5 A resolution for 
the receptor and approximately 4 A resolution for the entire complex 
including Fab (Fig. la-d and Extended Data Figs. 2, 3), with map 


quality that was substantially improved over that of the crystallo- 
graphic structure of the 20:38 assembly® (Extended Data Table 1 and 
Extended Data Figs. 4-6). Each receptor assembly is cylinder-shaped, 
with a large extracellular vestibule and funnel-shaped transmembrane 
channel. Fabs interact exclusively with the extracellular domain (ECD) 
of the 82 subunits (Extended Data Fig. 1e-h). The agonist nicotine was 
included throughout purification, and density for it was observed at 
a-$ and a-a interfaces in the receptor ECD. Nicotine stabilizes the 
ion channel in a putative closed-desensitized conformation. 

We sought to understand why we did not observe more than two 
subunit assemblies?”"°, given that our structures demonstrate the 
occurrence ofall possible interfaces between a4 and $2 subunits. Each 
stoichiometry contains two a-B and two B-a interfaces, plus a third 
unique interface type: one a-a (3:28) or one 8-8 (20:3) interface 
(Fig. 2a—d). Our reconstructions reveal that in an agonist bound, desen- 
sitized state, the a—a interface buries the most protein surface area 
from solvent, followed by the a-6 and B-a interfaces (Fig. 2e). The 8-6 
interface has the smallest amount of buried surface area. Calculated 
interfacial free energies predict preferential assembly of a-8, followed 
by B-c as the most favourable, followed by a—a. 6-8 is the least favour- 
able interface. A polar fenestration formed at the 3-8 interface is large 
enough (more than 5 A) for hydrated ions to pass through (Fig. 2d 
and Extended Data Fig. 7) and contributes to the decreased surface 


Cholesterol 


a1 helix MX 


Fig. 1 | Fab-assisted cryo-EM resolves heterogeneity in nicotinic 
receptor subunit assembly. a, b, 20:36-Fab complex reconstruction. 

Fab fragments are light grey, density corresponding to cholesterol is 

gold, and the a4 and 82 subunits are green and blue, respectively, with a 
lighter shade of green or blue denoting the subunit position that is unique 
between the two stoichiometries. c, d, 3a:28-Fab complex reconstruction. 
b, d, Dashed lines indicate approximate membrane position. 
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Calculated free energy 


Interface Interface area (A2) AG (kcal mol) 
2038 (a-B interface) 2,805 -45 
3028 (o-B interface) 2,797 44 
2038 (B-o interface) 2,525 42 
3028 (B-o interface) 2,532 -43 
3028 (o-a. interface) 2,923 -40 
2038 (B-B interface) 2,086 -38 


Fig. 2 | Interface classes. a-d, View from the periphery of the receptor 
towards the pore axis (left) and from the pore axis out towards the 
periphery of the receptor (right) for the four types of interface. a4 and 
82 subunits are coloured as in Fig. 1. e, Calculated interfacial buried 
surface areas and free energies. For interface classes for which there are 
two interfaces per assembly (a-@ and 6-a interfaces), values shown are 
averages. 


area buried at this interface class. These observations suggest that the 
more favourable a-6 and B-a interfaces assemble first, followed by 
incorporation of the fifth subunit to create the unique interface as the 
final step in pentamer formation. Superposition of like subunits, both 
within an assembly and between assemblies, reveals no substantial 
conformational differences among a-subunits. However, there is a 
rigid body twist of 4.4° in the ECDs relative to the transmembrane 
domains (TMDs) of the two B-subunits that compose the unique B-8 
interface compared to all other 32 subunits (Extended Data Fig. 6d-h). 
This conformational heterogeneity in 82 subunit structure is likely to 
add to the instability of the 8-6 interface. These structural observa- 
tions suggest that the 30:28 stoichiometry would be the predominant 
assembly in the absence of other factors, consistent with observations 
in rat motor cortex!! and in expression of recombinant mammalian 
receptors”. In our sample preparation, we exploited nicotine and low 
temperature, two factors that are known to shift the stoichiometry 
towards the 20:38 assembly, during viral transduction to boost overall 
receptor expression’. 

Analysis of interfaces and subunit conformation suggested an order 
of subunit incorporation in the observed 20:38 and 3«:28 assemblies 
driven by free energy, but did not explain why other theoretically possi- 
ble subunit stoichiometries were not observed. To address this question, 
we superimposed the unique interface from each assembly (a-a or 
8-8) upon itself to build up theoretical homopentameric assemblies, 
wherein all interfaces except for the final interface would be equivalent 
to the starting, experimentally observed interface. We then analysed 
this last, non-experimental interface and found in the 5a assembly a 
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marked increase in surface area buried compared to the experimen- 
tal «-a interface (3,849 A? versus 2,923 A’) and 42 atomic clashes 
(Extended Data Fig. 8). Put simply, homopentameric a4 assemblies 
are unfavourable because they pack too tightly. Conversely, in the 53 
assembly, at the final, non-experimental interface, we observed a gap 
between subunits resulting in a large decrease in surface area buried 
compared to the experimental 8-8 interface (1,051 A? versus 2,086 A2). 
Thus, 8-8 interfaces pack such that they create too obtuse an angle to 
effectively close the pentameric ring. We added one a-( interface into 
each of these homopentameric assemblies to examine the final theoret- 
ically possible subunit stoichiometries and observed a small decrease in 
the number of clashes in the 40:18 assembly (39 clashes), and a small 
increase in the surface area buried at the 1:48 assembly (1,301 A?). 
We conclude from these comparisons that only two stoichiometries are 
observed because only one a-a or one B-( interface can be accommo- 
dated in the pentamer while maintaining favourable interactions at all 
interfaces around the pentameric ring. 

The ratio of the two observed assemblies of the «432 receptor in 
vivo is tunable and context-specific and has been linked to genetic 
disorders and addiction?~°. Smoking, as well as nicotine added to 
cultured cells, preferentially upregulates the 2«:38 stoichiometry’. 
Mutations that shift the stoichiometry towards either the 30:26 or 
2:38 assembly result in autosomal-dominant nocturnal frontal lobe 
epilepsy*°. A high-penetrance form of this syndrome is linked to a 
serine-to-leucine mutation in the a4 subunit, at the 10’ position in 
the M2 pore-lining helix (Extended Data Fig. 6c), which results in 
a shift in subunit stoichiometry towards the 3c:28 assembly*. The 
mechanism underlying this shift in assembly has been unclear. In the 
cryo-EM structures, we observe this serine side chain orienting into a 
hydrophobic pocket in the adjacent subunit, regardless of the identity 
of that subunit. Mutation from the polar serine to the hydrophobic leu- 
cine should be energetically favourable, given the local environment. 
This substitution would favour assembly of the 30:28 stoichiometry 
because with three mutated a-subunits, three interfaces would have 
improved hydrophobic interactions and fewer unsatisfied hydrogen 
bonding interactions, contrasted with only two such favourable inter- 
faces in the 20:38 assembly. 

The 30:26 assembly is classically described as the low-sensitivity iso- 
form of the «42 receptor, with half maximal effective concentration 
(ECs9) values for agonist approximately 100-fold higher than for the 
20:36 stoichiometry’. We sought to understand this agonist sensitivity 
difference in the context of receptor structure. The high-sensitivity 
neurotransmitter sites in the 042 receptor are found at the 04-82 sub- 
unit interfaces in ECDs. The a-a interface in the 30:28 assembly may 
serve as a low-sensitivity neurotransmitter site, occupation of which 
is necessary to obtain a full response’!. We included 1 mM nicotine 
in our preparation to saturate high- and low-sensitivity sites. A com- 
parison of the four classes of interface reveals density for nicotine 
at the low-sensitivity a—a interface in the 3a:28 stoichiometry 
in addition to the high-sensitivity a- interfaces in both stoichio- 
metries of the 0462 receptor (Fig. 3a, b and Extended Data Fig. 9). 
All a-6 interfaces are qualitatively equivalent and are consistent with 
the crystal structure® in the binding orientation of nicotine and the 
interactions it makes at this interface class. The binding orientation 
of nicotine at the a-6 neurotransmitter binding sites is distinct from 
that observed at the a-a binding site. In the a-( interfaces, nico- 
tine is oriented with its long axis approximately parallel to the pore 
axis, with its pyridine ring oriented away from the membrane. At the 
a-o interface (Fig. 3a and Extended Data Fig. 9), the pyridine ring 
of nicotine orients towards the pore axis with the molecule tilted off 
the membrane normal. This pose of nicotine in the a—-a binding site 
is consistent with crystal structures of other 0482 agonists bound to 
a soluble extracellular domain engineered to mimic the a—ca binding 
site in a482'4. This trend suggests that the distinct binding orienta- 
tion is a meaningful feature of the low-sensitivity a—a binding site. 
The difference in sensitivity between the classical a-$ neurotrans- 
mitter binding site (high sensitivity) and the unique a-a binding site 
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Fig. 3 | Agonist and pseudo-agonist binding sites. a~d, Molecular details 
of the a-a (a) and a-6 (b) nicotine-binding interfaces and corresponding 
regions at the 8-6 (c) and B-a (d) interfaces. Top row is synaptic 


(low sensitivity) has been attributed to three amino acid positions on 
the complementary (—) face of the binding pocket: V111/F119/L121 
for the 82 (—) subunit and H116/Q124/T126 for the a4 (—) subunit!*5 
(Fig. 3c, d and Extended Data Fig. 9). In the a-( site (Fig. 3b and 
Extended Data Fig. 9), the three hydrophobic residues from the 82 (—) 
subunit provide a surface of van der Waals contacts that orients the 
pyridine ring of nicotine, locking it in the aromatic box in a vertical 
position. In the a-a interface, two of the three variant amino acid 
positions of a4 (—), T126 and Q124, by the nature of being shorter 
and polar amino acids, provide a less compact and more hydrophilic 
site that allows the alternative binding orientation. 

Owing in part to limited high-resolution structural information for 
heteropentameric assemblies, the role of the pseudo-agonist sites, or 
interface sites in the ECD that do not bind agonist, has been relatively 
unexplored. Previous crystallographic work revealed the molecular 
architecture of the pseudo-agonist sites®, and here we extend this 
analysis to all interfaces found in the two stoichiometries of the 0462 
nicotinic acetylcholine receptor (Fig. 3c, d). The uniqueness of these 
pseudo-agonist site interfaces stems from the amino acid sidechains 
presented by 82 when it occupies the principal (+) side of the subunit 
interface. Of particular interest is R149, which would electrostatically 
and sterically impede agonists from binding and is positioned to form 
cation-7 interactions with the sandwiching aromatic residues Y95 
and Y196 from ($2 (+) (Fig. 3c, d). This intramolecular interaction is 
conserved among all pseudo-agonist binding interfaces between the 
two assemblies; furthermore, in the closed, presumably desensitized 
conformations presented here, all the B-a pseudo-agonist sites in both 
assemblies appear to be structurally equivalent. We speculate that the 
guanidium group of R149, which chemically and structurally mimics 
the nicotinic pharmacophore, may act in part as a covalent agonist, 
potentiating activation. 

The two 0482 receptor assemblies have different permeation proper- 
ties: the 3c:28 assembly is about threefold more permeable to Ca”* than 
the 20:3 assembly”® and has a higher single channel conductance’. 
Both permeation pathways are strongly electronegative, as would 
be expected for a cation-selective channel, with the 3:28 assembly 
having a stronger electronegative potential throughout (Fig. 4a, b). 
These differences in electrostatic potential are localized to the extra- 
cellular vestibule and the junction of the ECD and TMD. The stronger 
electronegative potential seen in the extracellular vestibule of the 30:28 
assembly could not be predicted from sequence alone, as the acidic side 
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chains lining the extracellular vestibule that are thought to affect con- 
ductance are conserved between the «4 and 82 subunits!”. The continu- 
ity of the electronegative potential is interrupted in the 20:38 assembly 
at the junction between the ECD and TMD, and results from opposing 
charges at the 20’ position, E268 in 30:28 and K260 in 20:38, in the 
pore-lining M2 a-helices (Fig. 4c). Point mutations at the 20’ position 
have demonstrated that having a negative charge at this extracellular 
entrance to the channel pore is an important determinant of conduct- 
ance in nicotinic receptors!* and calcium permeability in cation- 
selective Cys-loop receptors'®!’. In the «4{2 receptor specifically, 
mutational studies with concatemers!* have shown that increasing 
the proportion of negative charges at this position correlates directly 
with increasing permeability to Ca**. Thus, the difference in charge at 
the 20’ position is a principal determinant of the different permeation 
properties of the two receptor subunit assemblies. A caveat to these 
comparisons is that our electron microscopy construct lacks much of 
the receptor’s intracellular domain, a region that has a role in subunit 
assembly and ion conductance in the structurally homologous 5-HT3 
receptor!??°, 

Analysis of the transmembrane pore reveals that both 0482 receptor 
assemblies taper to a constriction point created by glutamates present 
in all subunits, located at the —1’ position of the M2 a-helices, where 
the pore interfaces with the cytosol (Fig. 4c). The 30:28 assembly has 
a smaller constriction point, a diameter of 2.4 A compared to 3.2A 
(Fig. 4d and Extended Data Fig. 10), which is consistent with the more 
tightly packed a-a interface compared to the 3-( interface in the 20:36 
assembly (Fig. 2e and Extended Data Figs. 7, 8). Early studies on per- 
meation through different channel classes suggested that permeation 
through nicotinic receptors (unlike for K* channels) occurs with the 
ion at least partially hydrated”!. The smallest physiologically permeant 
ion is Na*, with a crystal diameter of about 1.9 A. Addition of a single 
equatorial water molecule (diameter 2.8 A) raises the diameter of the 
permeant species above the size of both constriction points. With the 
caveat that the structure was determined in the presence of detergent 
and is not in a native lipid membrane, the constriction diameter is 
consistent with the pore representing a non-conducting, agonist-bound 
desensitized state. 

Reconstitution of nicotinic receptors into defined lipid preparations 
has demonstrated that cholesterol is important for nicotinic receptor 
function’. We tested cholesterol derivatives in thermal stabiliza- 
tion assays”* and found that the water-soluble cholesterol analogue 
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Fig. 4 | Pore and conductance determinants. a, Cutaway of the 30:28 
assembly showing the permeation pathway coloured by electrostatic 
potential. b, As in a, but for the 20:36 assembly. c, M2 a-helices from 
opposing a4 (green) and 82 (blue) subunits from the 3a:28 assembly with 
sidechains shown for pore-lining residues. Purple spheres indicate pore 
diameters over 5.6 A; green are 2.8-5.6 A; red are less than 2.8 A. d, Pore 
diameter for the «482 receptor 20:38 and 3:28 assemblies from the 
cryo-EM study compared to the crystal structure of the 20:36 assembly 
(Protein Data Bank (PDB) accession 5KXI*). The zero value along 

the y-axis of the plot is aligned with the a-carbon of the —1’ position. 
Uncertainty in the position of most acidic sidechains in EM density maps 
adds ambiguity to the constriction point diameters defined by the — 1’ 
glutamates. 


cholesteryl-hemisuccinate (CHS) potently stabilizes the receptor; thus, 
we included it throughout purification of the receptor. We identified 
two putative cholesterol molecules per subunit in the electron micro- 
scopy density maps (Extended Data Fig. 11). These sausage-shaped 
densities are found in pairs located at the receptor periphery along 
the intracellular half of the transmembrane domain, where they flank 
the subunit interfaces. When viewed at the interface, each putative 
cholesterol molecule interacts almost exclusively with one subunit and 
one adjacent cholesterol molecule. Each pair of cholesterol molecules 
shares a single amino acid contact that bridges these intrasubunit sites 
(Extended Data Fig. 11c, d). These intrasubunit sites contrast with 
intersubunit binding sites found for cholesterol and potentiating neu- 
rosteroids in GABA, receptor chimaeras**”°, and are distinct from the 
intrasubunit site identified for inhibitory neurosteroids ina GABA, 
receptor chimaera” (Extended Data Fig. 11f). Comparison of the 
cholesterol positions in the «482 structures suggests that the orienta- 
tions of the two molecules at an interface are dictated by the identity 
of the subunit on the principal side. When the principal subunit is 
82, the cholesterols are oriented parallel to the pore axis; however, 
when the principal subunit is «4, the apical ends of both molecules 
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tilt towards the principal face. The cholesterol binding site on the 
principal subunit is a bowl-shaped surface formed by the junction of 
the M3, MX and M4 helices. Close examination of this binding site 
reveals only a single amino acid position that is non-isosteric between 
the two subunit types, located on the penultimate helical turn in the 
M3 transmembrane helix. In 82 subunits this residue is a cysteine 
(C292) and in «4 subunits it is a phenylalanine (F300; Extended Data 
Fig. 11c-e). The bulky phenylalanine residue is likely to prevent access 
to the apical portion of the binding region observed when (2 sub- 
units form the principal face. Analysis of human Cys-loop receptor 
sequences reveals that a small hydrophobic residue that would allow 
this axial binding mode is conserved in neuronal 2 and $4 subunits as 
well as the muscle-type 31 subunit. Meanwhile, in all a-subunits from 
cation-selective receptor types, a bulky hydrophobic residue (FE I, L or M) 
is conserved at this position. Our assignment of this cholesterol 
binding site is consistent with mapping studies using photo-activatable 
cholesterol analogues”*”’, 

Here we used single particle cryo-EM coupled with antibody frag- 
ments to determine structures of both stoichiometries of the «482 
nicotinic receptor from one experimental preparation. Comparison 
of the structures reveals principles underlying subunit assembly and 
the differences in permeation properties between the two stoichio- 
metries. We observed agonist binding at the non-canonical a-a 
subunit interface, differentially stabilized by residues unique to this 
interface class. The structure further allowed mapping of putative 
cholesterol sites on the surface of the transmembrane domain near 
the cytosolic junction. This study provides a framework for further 
biophysical analysis of the allosteric interfacial sites, therapeutic 
targeting of these sites with modulators akin to benzodiazepines, and 
more broadly, a general approach for resolving the structural biology 
of heterogeneous assemblies. 
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METHODS 


Nicotinic receptor expression and purification. The human «4 and £2 nicotinic 
receptor genes were provided by J. Lindstrom at the University of Pennsylvania. 
Deletion constructs used for expression and structural analysis lack a large por- 
tion of the intracellular domain of the receptor, as previously described®. These 
genes encode the native signal peptides, residues 1-338 and 556-601 in the a4 
subunit and residues 1-330 and 417-477 in the 82 subunit (residue number- 
ing here is for the wild-type mature, signal-peptide-cleaved protein sequence). 
To promote good biochemical behaviour a Glu-Arg linker was inserted in the 
MX-M4 junction, between Phe559 and Ser560 in the a4 subunit and between 
Gln420 and Ser421 in the 82 subunit. For purification purposes a Strep-tag was 
inserted at the C terminus of the 82 subunit preceded by a Ser-Ala linker. The 
genes were subcloned into the pEZT bacmam expression vector’® and viral titra- 
tion and protein expression in GnTI- HEK cells (ATCC CRL-3022, not myco- 
plasma tested) were performed as previously described”* with modifications. 
We transduced 3.21 of suspension GnTI- cells with multiplicities of infection 
(MOIs) of 0.5:0.5 for the a4 and 32 subunits, respectively, to promote expres- 
sion of two receptor assemblies. Nicotine (Sigma-Aldrich) and sodium butyrate 
(Sigma-Aldrich) were added at the time of transduction to 0.1 mM and 3 mM, 
respectively. At the time of transduction, suspension cells were moved to 30°C 
and 8% COp. Cells were collected after ~72h by centrifugation, resuspended 
in 20mM Tris, pH 7.4, 150mM NaCl (TBS buffer), 1 mM nicotine and 1mM 
phenylmethanesulfony! fluoride (Sigma-Aldrich), and disrupted using an Avestin 
Emulsiflex. Lysed cells were centrifuged for 15 min at 10,000g; supernatants con- 
taining membranes were centrifuged for 2h at 186,000g. Membrane pellets were 
mechanically homogenized and solubilized for 1h at 4°C in a solution containing 
TBS, 40 mM n-dodecyl-$-p-maltopyranoside (DDM; Anatrace), 1 mM nicotine 
and 0.2mM CHS (Tris salt, Anatrace). Solubilized membranes were centrifuged 
for 40 min at 186,000g then passed over high capacity Strep-Tactin (IBA) affinity 
resin. The resin was washed with Size Exclusion Chromatography (SEC) buffer 
containing TBS, 1mM DDM, 1 mM nicotine, 0.2 mM CHS and 1mM TCEP 
(Thermo Fisher Scientific) and eluted in the same buffer containing 5mM 
desthiobiotin (Sigma-Aldrich). 

Generation of monoclonal antibodies and Fab fragments. The 12H2 mono- 
clonal antibody (mAb) against the 0482 nicotinic receptor (IgG1, «) was raised 
by D. Cawley (Monoclonal Core, Vaccine and Gene Therapy Institute, OHSU) 
using standard methods. Hybridomas were generated by immunization of mice 
(female BALB/c) with 0482 receptor protein purified in DDM, followed by fusion 
of splenocytes with mouse myeloma cells and selection of clones by enzyme-linked 
immunosorbent assay. The mAb was purified from hybridoma cell supernatant by 
cation exchange and protein A affinity chromatography. High affinity and specificity 
of the antibody for properly folded pentameric «482 receptor was assessed using 
Fluorescence-SEC (FSEC) with GFP-tagged receptor (shift in elution volume) and 
by western blot (no binding). The top candidate antibodies were then more thor- 
oughly assessed using FSEC monitoring tryptophan fluorescence with purified 
0482 receptor and Fab fragments to ensure fidelity of the «482-Fab complex and to 
estimate receptor:Fab stoichiometry. 0482-Fab complexes were co-purified, then 
re-injected in analytical FSEC to test for dissociation under these dilute conditions 
over a Sepax SRT SEC-500 (4.6 x 300 mm) gel filtration column. Fabs that did not 
dissociate were qualitatively deemed to have sufficiently high affinity to use in 
EM studies. The 12H2 Fab fragment used here was generated by papain cleavage 
of whole antibody at a final concentration of 1 mg/ml for 2h at 37°C in 50mM 
NaPOx,, pH 7.0, 1mM EDTA, 10mM cysteine and 1:100 (w:w) papain. Digestion 
was quenched using 30 mM iodoacetamide at 25°C for 10 min. Fab was purified 
by cation exchange using a Hi-Trap SP column in 10mM sodium citrate pH 5.0 
and a NaCl gradient elution. Cloning and sequencing of Fab heavy and light chain 
genes were performed from mouse hybridoma cells. 

Cryo-EM sample preparation. Purified 0482 from affinity chromatography was 
mixed with Fab in a 1:2 (w:w) ratio and injected over a Superose 6 Increase 10/300 
GL column (GE) equilibrated in SEC buffer. Peak fractions were assayed by SEC, 
monitoring tryptophan fluorescence. The peak fraction was concentrated fourfold 
to an A280 of 5.9 in SEC buffer, ultracentrifuged, and used directly to prepare 
EM grids. Three microlitres of purified «482-nicotine-Fab complex was applied 
to glow-discharged gold R1.2/1.3 300 mesh holey carbon grids (Quantifoil) and 
immediately blotted for 4s at ~100% humidity and 4°C, then plunge-frozen into 
liquid ethane cooled by liquid nitrogen using a Vitrobot Mark IV (FEI). 
Cryo-EM image collection and processing. Two separate data sessions were 
taken on a Titan Krios electron microscope (FEI) operated at 300 kV. Images were 
recorded on a K2 Summit direct electron detector (Gatan) equipped with GIF 
quantum energy filter (20 eV) (Gatan) in super-resolution mode (super-resolu- 
tion pixel size: 0.535 A per pixel). In the first session, 3,323 images were collected 
and each micrograph in the first session was exposed for 15s with a dose rate 
of ~5 e/A?/s (total specimen dose, ~75 e/ A’), and 30 frames were captured 
per micrograph. In the second session, 2,588 images were collected and each 


micrograph in the second session was exposed for 10s with ~5 e-/A?/s dose rate 
(total specimen dose, ~50 e~/A?), and 40 frames were captured per micrograph. 
Images were recorded using the automated-acquisition program EPU (FEI) with 
defocus values varied from 0.5 1m to 4.5 1m. 

Dose-fractionated images (movies) were gain normalized, 2 x Fourier binned 

(resulting in a pixel size of 1.07 A per pixel), aligned, dose-weighted and summed 
using Unblur”’. Defocus values were estimated using GCTF*”. A total of 5,166 
micrographs were manually selected and used for further analysis (2,836 out 
of 3,323 from the first session and 2,330 of 2,588 from the second session). 
Approximately 100 particles were manually picked and subjected to reference-free 
2D class averaging in RELION 2.1°!. Resulting references were then used for 
auto-picking in RELION 2.1 for a subset of 10 images, then the classification 
process was repeated to obtain good references for auto-picking on all images. 
Following auto-picking, micrographs were manually inspected, and false-positive 
particles were manually removed. Curation of auto-picked particles resulted in a 
total of 649,773 particles. 516,450 particles were selected following 2D classification 
in RELION. Two-hundred particles from 30 different 2D classes (6,000 particles in 
total) were used to generate an ab initio model in RELION. 3D classification using 
eight classes resulted in six good classes (403,246 particles) that could be divided 
into to two distinct categories based on Fab occupancy and subunit-defining fea- 
tures, representing the 30:28 (139,551), and 2:36 (263,695) stoichiometries of the 
0482 receptor. 3D refinement of these two pooled particle sets yielded reconstruc- 
tions of the overall receptor-Fab complexes at ~4.0 A resolution. The Fab molecules, 
in particular the peripheral constant domains, were poorly resolved, and of less 
biological interest to us, so we focused our effort on the receptor. Removing the Fab 
regions from the global resolution calculations results in reconstructions for the 
30:28 assembly at 3.7 A and for the 21:38 assembly at 3.4 A resolution, consistent 
with side chain features throughout the density maps. All global resolutions were 
estimated by applying a soft mask around the protein density and the gold-standard 
Fourier shell correction (FSC) = 0.143 criterion. ResMap*? was used to estimate 
the local resolution. 
Model building, refinement and validation. The higher resolution 20:38 
assembly was built first. The X-ray coordinates (PDB accession code 5KXI) 
were docked into the density map. Secondary structural features that are dis- 
tinct between the two stoichiometries (loop C and a1 helix, Fig. 1) were used to 
guide the initial rigid body fit using UCSF Chimera™. SwissModel*° was used 
to generate a homology model of the Fab light chain using PDB entry 4TPR and 
for the heavy chain using PDB entry 2ZJS. These chains were docked into the 
EMD density at one Fab site. Manual adjustments of the receptor-Fab structure 
were then performed in Coot*®. The ECD and transmembrane domain (TMD) 
halves of each subunit, and the variable and constant halves of each Fab, were 
rigid body fitted into the density map. The epitope binding loops of the Fabs 
were rebuilt into unambiguous density at the site of interaction with the receptor, 
and then this first Fab molecule was copied into the two additional sites in the 
2:38 assembly. The map quality for the receptor exceeded that from the earlier 
X-ray structure, which allowed for more residues to be built at the N termini and 
in the linkers connecting the MX amphipathic helix with the M4 helix. There 
was unclear density and thus a gap in the connectivity in the MX-M4 linker for 
7 residues in a4 subunits and 10 residues in 32 subunits. There was improved 
density for side chains and better connectivity in the ECD loops, in particular 
loop C in the 8-subunits. After rebuilding in Coot, global real space coordinate 
and B factor refinement were performed in Phenix’. 

The cryo-EM density maps drew our attention to a notable conformational 
difference in the loop C of 82 subunits compared to that modelled in the crystal- 
lographic structure’. We observed continuous density for this substructure in all 82 
subunits in EM maps that resolve loop C in a conformation bending out and away 
from the subunit interface (Fig. 3 c, d and Extended Data Fig. 6a—c). This confor- 
mation was independent of whether Fab was bound to the adjacent (—) subunit 
or not (Extended Data Fig. le, f). In the crystal structure, loop C in 82 subunits 
was modelled in a conformation packed against the (—) subunit, roughly akin to 
the conformation of loop C in a4 subunits. The crystallographic modelling was 
based on clear electron density for loop C in one $2 subunit that was at a crystal 
contact (Extended Data Fig. 6a—c). We thus suggest that an open conformation of 
loop C in 82 subunits is the proper one. 

The validation to test for overfitting of the model was performed as previously 
described**. In brief, the atom positions of the final refined model were randomly 
displaced by a maximum of 0.5 A using PDBSET in the CCP4 suite*®. This per- 
turbed model was then refined in Phenix in real space against the first half map of 
the reconstruction comprising 50% of the particles. A map versus model FSC com- 
parison was made for this model versus the map used in its refinement (‘work’), as 
well as the same model versus the half map not used in refinement (‘free’). The FSC 
curves of map versus model agree well (Extended Data Figs. 4, 5). 

Sequences used in alignments were retrieved from the UniProtKB database“. 
Sequence alignments were made using PROMALS3D*". Subunit interfaces includ- 
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ing free energies were analysed using the PDBePISA server**. Pore diameters were 
calculated using HOLE. Domain motions were analysed using DynDom“. 
Ligand interactions were characterized by visual inspection and analysis using 
LIGPLOT*. Structural figures were made using UCSF-Chimera*4 and PyMOL 
(Schrodinger, LLC) including the APBS electrostatics plugin“*. Structural biology 
software packages were compiled by SBGrid’”. 

Density maps were displayed in Chimera at a threshold of 0.0306 in Figs. 1, 2. 
In Fig. 3, ligand density is displayed in Chimera at a threshold of 0.0403 for the 
a-a interface, and 0.05 for the «-8 interface. 
Radioligand binding. Experiments to test the effect of Fab on [7H]-nicotine 
(PerkinElmer, 79.8 Ci/mmol) binding to the 0482 receptor were performed 
with protein purified as for EM (that is, a mixture of stoichiometries) but 
in the absence of Fab and nicotine. The concentration of binding sites was 
kept at 2.5nM after a preliminary experiment to determine optimal receptor 
concentration. For the + Fab samples, Fab was added in excess (51M). In addi- 
tion to the protein, the binding assay conditions included 20 mM Tris pH 7.4, 
150mM NaCl, 1mM DDM, and 1 mg/ml streptavidin- YiSi scintillation prox- 
imity assay beads (SPA; GE Healthcare Life Sciences). Non-specific signal was 
determined in the presence of 1001.M ['H]-nicotine; all data shown are from 
background-subtracted measurements. Experiments were performed in tripli- 
cate. All data were analysed using Prism 6 software (GraphPad) with variable 
Hill slope. 
Electrophysiology. The open reading frames of a4 and $2 EM constructs 
described above were subcloned into the pGH19 vector*’. CRNAs of «4 and 82 
constructs were synthesized in vitro using the MEGAscript T7 Transcription Kit 
(Thermo Fisher). Xenopus laevis ovaries purchased from NASCO (LM00935MX) 
were treated with 1-2 mg/ml collagenase I (Gibco) in Barth’s solution (in mM, 88 
NaCl, 1 KCl, 2.4 NaHCOs, 0.82 MgSOu, 0.33 Ca(NO3)2, 0.68 CaCl, 10 Hepes, 
pH 7.4) for 1-1.5h, and washed with ND96 solution (in mM, 96 NaCl, 2 KCl, 1 
MgCl). 1.8 CaCl», and 5 Hepes pH 7.4). Stages V-VI oocytes were collected and 
kept at 17°C in ND96 solution with 25 |1.g/ml ampicillin and 0.1 mg/ml gentamycin. 
Oocytes were injected with 1-10 ng cRNA mixture in the ratio 1:5 or 5:1 
of «4:82. Two to three days after injection of CRNA, receptor activity was meas- 
ured by two-electrode voltage clamp recording in ND96 solution using Oocyte 
Clamp OC-725 amplifier (Warner Instruments), and the signal was digitized with 
a Digidata 1440 A (Molecular Devices). Data were analysed using Clamp 10.3. 
Pipettes were filled with 1 M KCl and showed resistances of 0.8-2.0 MQ. Ligands 
were dissolved in ND96. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 
Data availability. Atomic coordinates of the two 0482 receptor-nicotine-Fab 
complexes have been deposited in the Protein Data Bank (PDB; 20:38, 6CNJ; 
3:28, 6CNK). The cryo-EM density maps have been deposited in the Electron 
Microscopy Data Bank (20:38, 7535; 30:28, 7536). 
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Extended Data Fig. 1 | See next page for caption. 
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Extended Data Fig. 1 | Biochemistry, binding, electrophysiology and 
0.4682-Fab interactions. a, Size-exclusion chromatogram of 0482-Fab 
complex and SDS-PAGE analysis of complex purification (Vo, void 
volume). b, Saturation binding experiments with [*H]-nicotine anda 
mixture of receptor subunit stoichiometries. Grey squares denote samples 
with receptor plus Fab and blue circles denote samples with receptor alone. 
Receptor alone Ka=7.7 nM (95% confidence interval (CI), 6.9-8.6 nM) 
and receptor plus Fab Ka=17.0nM (95% CI, 10.2-26.6 nM). Receptor 
alone and receptor plus Fab both exhibited a Hill slope of approximately 

1 (1.14 and 1.09, respectively). Plotted results are from a representative 
experiment performed in triplicate. c, Representative two-electrode 
voltage clamp (—60 mV) recordings of oocytes injected with cRNAs for 
the a4 and 82 subunits in ratios to bias assembly, 10:58 (top two traces) 
and 5a:18 (bottom two traces) for the 20:38 and 3a:28 assemblies, 
respectively. The experiments were performed in the presence and 
absence of Fab to assess the effect of Fab on receptor gating. Each oocyte 
was perfused with 11M nicotine solution for about 2 min, washed with 
bath solution for 10 min and then perfused for about 2 min with 11M 
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nicotine + Fab. For samples where Fab was included, 1 1M final [Fab] was 
added directly to the oocyte bath following the first perfusion and allowed 
to incubate for 1.5 min before perfusing with a second solution containing 
1M nicotine + 0.1 {1M Fab. d, Bar graph quantifying peak currents before 
and after adding Fab. Currents were normalized to the amplitude of the 
first nicotine application. Appreciable current rundown was observed 
(assessed by applying nicotine without Fab as the second application). 
Change in peak current is similar in the presence and absence of Fab for 
both stoichiometries, suggesting no substantial effect on gating. n, number 
of oocytes; error bars, s.d. from mean. e-h, Fab-3-subunit interactions at 
the a-, 8-6, a-B, and a-a interfaces, respectively. With one exception, 
the Fab molecules interact exclusively with a single 3-subunit. The 
exception is at the 6- interface, where the Fab on the complementary 
8-subunit forms one potential interaction with the preceding B-subunit. 
This interaction is displayed as a dashed line (f, inset). The conformation 
of the principal loop C at the 6-( interface is indistinguishable from those 
where Fab is not interacting, suggesting that Fab does not affect the loop C 
conformation. Subunits are coloured as in Fig. 1 
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Extended Data Fig. 2 | Cryo-EM image processing procedure. images (top/bottom and side views). b, Images of selected 2D classes 
a, Representative micrograph of the a482-Fab complex (scale bar,50nm). —_ from reference-free 2D classification by RELION. White arrowheads, Fab 
Boxed regions on the right are magnified to show representative particle fragments. c, Overview of the image processing procedure (see Methods). 
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Extended Data Fig. 3 | Three-dimensional reconstructions of the 
20:36 and 3a:26 assemblies. a, Angular distribution histogram of 20:38 
assembly projections. b, FSC of 2«:38 assembly maps before (black) 

and after (blue and red) masking. Two soft masks were used: one (red) 
mask that included only the receptor and one (blue) mask that included 
the whole 20:36-Fab complex. When a mask was used, the FSC curve 


was corrected for masking effects during the RELION post-processing 
procedure. Masks used in FSC calculations are shown in c for the whole 
20:38-Fab complex (top) and receptor only (bottom), superimposed on 
the map in question. d, Local resolution of the 2a36-Fab reconstruction 
estimated by ResMap**. Shown is the combined map, which has not been 
sharpened or filtered. e-h, As in a—d but for the 30:26 assembly. 
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versus summed map (full) in black, for model versus half map (work) in spanning regions (right). Regions are numbered and helices are labelled. 
red, and for model versus half map not used for refinement (free) in blue. Maps were sharpened with a single B factor at —150 A?. d, As inc, but fora 
For validation of receptor alone (without Fab), maps were segmented to representative 82 subunit. 
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Extended Data Fig. 5 | 20:36 assembly model-map validation. a, b, FSC 
curves for cross-validation between the maps and the models with (a) and 
without Fab fragments (b) for the 20:38 assembly. Curves for final model 
versus summed map (full) in black, for model versus half map (work) 

in red, and for model versus half map not used for refinement (free) in 
blue. For validation of receptor alone (without Fab) maps segmented to 
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exclude the Fab fragments were used. c, EM density segments of the 20:38 
assembly for a representative 04 subunit. Density map and model for a4 
subunit (left). Representative density for extracellular and transmembrane 
spanning regions (right). Regions are numbered and helices are labelled. 
Maps were sharpened with a single B factor at —170 A2. d, Asin, but fora 
representative 82 subunit. 
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Extended Data Fig. 6 | Comparison of map and model of 2038 by 
X-ray crystallography at 3.9 A resolution and by cryo-EM at 3.4A 
resolution, and comparison of subunit backbone conformations. 

a, The X-ray electron density map (left) and cryo-EM map (right) are 
displayed in a linear representation (unwrapped) to illustrate the overall 
resolvability of all five subunits: two a4 subunits (green) and three 32 
subunits (blue). Red circles indicate the locations of crystal contacts on the 
X-ray electron density map (left) or Fab binding sites on cryo-EM density 
map (right). b, X-ray structure of the 20:38 assembly (PDB accession 
5KXI®) after alignment to the cryo-EM structure (this study) shown in 
ribbon representation and coloured according to Ca r.m.s.d. values (from 
blue (low) to red (high)). While the areas involved in crystal contacts 
displayed relatively higher r.m.s.d. values for Co (> 2A), the Fab binding 
location shows r.m.s.d. values for Ca of < 1 A. This result implies that 
zones of protein-protein contacts in the crystal lattice may substantially 
affect local structure on the pentamer, but Fab binding in the cryo-EM 
study did not result in substantial structural change, consistent with the 
binding assays and electrophysiological results (Extended Data Fig. 1). 

c, Fit of X-ray model (5KXI) to X-ray electron density map (left) and 
cryo-EM model to cryo-EM density map (middle) shown as selected areas 
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for high r.m.s.d. (loop C) and low r.m.s.d. (M2 helix), respectively. Right 
column shows superposition of X-ray and cryo-EM model at the selected 
areas. d-h, Subunit superpositions. d, Comparison of all «4 subunits in 
the 20:38 and 3a:28 assemblies reveals no substantial conformational 
differences in a-carbon backbones. e, Superposition of all 82 subunits in 
the two assemblies reveals a domain motion of the TMD relative to the 
ECD of the subunits that comprise the 3-( interface. The rotation of 4.4° 
between the two 8-subunits that compose the 8-( interface is denoted by 
black double-headed arrows. The principal 3-subunit (82 + ) is coloured 
magenta while the complementary B-subunit (82 —) is shown in cyan. 
f-h, Superposition of «4 (green) with the three distinct 82 conformations: 
principal 8-8 (magenta) complementary 6-8 (cyan) and a-B-a (grey). 
The conformation of the principal 8-6 (magenta) subunit is distinct from 
every other subunit in the 20:38 and 3:28 assemblies. The conformation 
of the complementary (—) subunit of the 6-( interface is rotated towards 
the pore axis, accommodating the marked conformational change of the 
principal 82 subunit away from the pore axis. This conformational change 
results in the complementary 8-( (cyan) subunit adopting a backbone 
conformation similar to a4 subunits. 
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Extended Data Fig. 7 | Fenestration at 8-6 interface. Extracellular fenestration shown as sticks. Two conserved glutamic acid residues 
fenestration unique to this class of interface. Inset of fenestration in close proximity to this gap at the B- interface (yellow) have been 
with distances indicated by dashed lines and side chains surrounding implicated in Ca?* potentiation of nicotinic receptors’. 
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Extended Data Fig. 8 | Basis of «482 heteromeric assembly. 

a, Cartoon representation of top view of observed (20:38 and 30:28) and 
computational (40:18, 10:48, 5a and 53) «482 pentameric assemblies. 
Assemblies on the top row (30:28, 40:18, and 5c) are arranged by 
increasing a4 composition. Assemblies on the bottom row (20:38, 1a:48 
and 53) are arranged by increasing $2 composition. Agonist binding sites 
are denoted by red circles. Buried interface areas (A?) for the interfaces 
analysed in b, c are listed below each pentameric assembly. Subunits are 
coloured in a as described below for b, c. b, Superposition of a-a from 
30:26 and final a—a interface in 5a homopentamer. Principal subunits 


(grey) were superimposed to highlight differences at the interface. 
Complementary (—) subunits are coloured light green for a-a from 
30:28 and dark green for the a-a interface in the 5« homopentamer. 
Sticks are displayed for amino acid clashes that have greater than 1.5A 
overlap assessed by Molprobity. c, Superposition of the 8-6 interface from 
2:38 and final 6-6 interface in the 53 homopentamer. Principal subunits 
(grey) were superimposed to highlight differences at the interface. 
Complementary (—) subunits are coloured light blue for the 8-( interface 
from the 20:36 assembly and dark blue for the final B—-( interface in the 58 
homopentamer. 
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Extended Data Fig. 9 | Ligand binding site comparisons. a, Stereo 
images of nicotine bound at the a-a interface. Subunits are coloured as 

in Fig. 1. Density (purple mesh) is displayed at a threshold of 0.0306 in 
Chimera. Electrostatic interactions denoted as dashed magenta lines. 
Binding pocket residues and nicotine displayed as sticks. b, Ligplot of 
nicotine bound at the a-a and a-( interfaces (left and right, respectively). 
Residue contacts within 4.5 A are displayed in red, residue displayed in 
purple (His116) is within 5.5 A. c, Stereo images of nicotine bound at the 
a-f interface. Density (purple mesh) is displayed at a threshold of 0.0306 
in Chimera. Hydrogen bonds denoted as dashed magenta lines. Binding 
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pocket residues and nicotine displayed as sticks. d, e, Stereo image overlays 
of nicotine bound at the a—-a (magenta) and a-68 interfaces (cyan). Ligand 
density is displayed as transparent surfaces. Density is displayed at a 
threshold of 0.406 for the a—-a (magenta) and 0.500 for the a-8 (cyan) 
interface in Chimera. e, Ribbon removed for clarity, rotated 180° relative 
to d. Key residues implicated in the difference in sensitivity between the 
classical a-$ neurotransmitter binding site (high sensitivity V111/F119/ 
L121) and the unique a-a binding site (low sensitivity H116/Q124/T126) 
are labelled. 
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Extended Data Fig. 10 | Comparison of ions located in the pore above 
the constriction point. a, Top view of EM density map of ion in the pore 
of the 3«:28 assembly. Density (blue mesh) is displayed at a threshold of 
0.027 in Chimera. Subunits are coloured as in Fig. 1. Modelled Na* ion is 
represented as a purple sphere. Nearest residues on the M2 a-helices are 


a4 


B2 


indicated. b, As a, but for the 20:38 assembly. c, Side view of EM density 
map of ion in the pore of the 3:23 assembly. Two subunits (one « and one 
8) removed for clarity. Colours and residues indicated as in a. d, As c, but 
for the 20:38 assembly. e, Superposition of constriction region of 20:38 
(light blue) and 30:26 (light green) assemblies. 
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Extended Data Fig. 11 | See next page for caption. 
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Extended Data Fig. 11 | Comparison of putative cholesterol binding 
orientations. a, b, EM density map showing cholesterol sites from all 
TMD interfaces in the 30:26 assembly and 2«:38 assembly, respectively. 
Pentagons (top) are coloured to illustrate subunits composing the 
displayed interface. c, Stereo image of representative cholesterol binding 
site with a-subunit on principal side (+). When the principal side (+) is 
an a4 subunit, both molecules tilt towards the principal face. One residue 
from the complementary side 82 (—), Y232, contacts both cholesterol 
molecules at the a-( interface (at the a-a interface the equivalent residue 
¥240 in the complementary (—) a4 contacts both cholesterol molecules 

at the interface; not shown). d, Stereo image of representative cholesterol 
binding site with B-subunit on principal side (+ ). When the principal side 
(+) is a2 subunit, both cholesterol molecules are oriented orthogonal 

to the plane of the membrane. One residue from the principal (+) 2 


LETTER 


subunit, V288, makes contacts with both cholesterol molecules at the 8-a 
and 6-6 (not shown) interfaces. Cholesterol and interacting side chains are 
shown as sticks. Density maps in panels a—d are displayed at a threshold of 
0.025 in Chimera. e, Alignment of nicotinic receptor subunits for region 
encompassing putative cholesterol sites. Residues in cholesterol binding 
pocket are boxed and amino acid position implicated in differential 
cholesterol binding is highlighted in magenta. Uniprot accession IDs are 
provided*®. Residues mapped using a photoreactive cholesterol analogue 
in Torpedo californica (TC P02710)*°*’ are highlighted in cyan. All other 
sequences are from Homo sapiens (HS P43681, P11230, P17787, Q05901, 
P30926 for a4 and 61-4, respectively). f, Superposition of B-a (yellow) 
from a482 and al-cal (blue) GABA,-GLIC chimaera (PDB accession 
5OSC**) TMD interface to compare putative binding sites. Pregnenolone 
sulfate, CHS and cholesterol are shown as sticks. 
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Extended Data Table 1 | Cryo-EM data collection, refinement and validation statistics 


Dataset 


Data collection and processing 
Magnification 
Voltage (kV) 
Electron exposure (e-/A*) 
Defocus range (um) 
Pixel size (A) 
Symmetry Imposed 
Initial number of particles 
Final number of particles 
Map resolution (A) 

FSC threshold 


Refinement 
Intial model used (PDB code) 
Model resolution (A) 

FSC threshold 
Map-sharpening B factor (A) 
Model composition 

Non-H atoms 

Protein residues 


B factors (A’) 
Receptor 
Fab 
Glycosylation 
Ligand 
Cholesterol 
lons 

Validation 


MolProbity score 
Clashscore 
Poor rotamers (%) 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 
*Mask including Fab 


*Mask including Fab. 


2a3B stoichiometry 
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46,730 
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117 


1.9 (100" percentile) 
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-0.5 to -4.5 


1.07 
C1 
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139,551 
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0.5 
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22,082 
2,757 
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27 


1.98 (100"" percentile) 
13.6 
0.16 


0.007 
0.69 


95.1 
4.9 
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Out of the darkness 


Depression and related disorders can be devastating, but there are ways to fight them. 


ental illness is widespread in the 
M sciences, and graduate students are 

particularly vulnerable (Nature 556, 
5; 2018). Building a strong support network 
and separating your sense of self from your job 
are crucial. Here, in the second of our three- 
part series on mental health in academia, four 
researchers share their 
stories and advice. Next 


week, we examine lab Read more on mental 
leadership and and how _healthinscience at 
to improve the health of 


research groups. 


‘Crazy thoughts’ 


Lawyer and psychoanalyst at the 
University of Southern California Gould 
School of Law, Los Angeles 


It was 1982, and I was in my first semester as 
a student at Yale Law School. We had a paper 
due and I climbed onto a roof, singing, dancing, 


gesticulating and saying crazy things, like Ihad 
killed people with my thoughts. My friends 
called the student health centre and tried to get 
me there, but I wouldnt go. 

The next morning, I asked for an extension 
on the paper. I was still saying crazy things, and 
my professor took me to the emergency depart- 
ment. They committed me. I was hospitalized 
for five months, restrained, forcibly medicated, 
given very little privacy and locked up. 

If you were withdrawn from Yale for 
psychiatric reasons, you had to be evaluated for 
readmission by the head of university health 


10 MAY 2018 | VOL 557 | NATURE | 267 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


> psychiatry. I looked this person up and found 
an article hed written on exactly this topic — the 
questions that the evaluator should ask, and the 
answers to look for. I was totally prepared, and 
it unfolded exactly as he had laid it out. 

Still, part of me felt like I would never get back 
on my feet. The guy advised me to become a 
cashier for two or three years. I asked myself 
how much more stressful it would be to stand at 
acash register dealing with a long line of people 
waiting for change from their purchase. So I 
went back to Yale. 

I was eventually diagnosed with schizo- 
phrenia, and it took me ten years to become 
reconciled to staying on medication. Once I 
did, my life got much better. In a work envi- 
ronment, it also helps if you have people who 
know what you're going through. But some 
people will think you're not up to the job or 
are dangerous or scary. You've got to pick your 
supporters carefully. 

And stigma is a real scourge. When my 
memoir, The Center Cannot Hold: My Journey 
Through Madness, came out in 2007, an admin- 
istrator on the staff at the University of Southern 
California Gould School of Law told me that she 
was glad she didn’t know I had schizophrenia 
when we started going to dinner together. She 
said she'd never have gone. I was stunned that 
a smart, kind, well-meaning person would 
have such a picture of mental illness that she 
wouldn't have gone to dinner with me. 

I've learnt that my mind is my best friend and 
my worst enemy. When I’m working, the crazy 
thoughts recede. 


NATHANIEL BORENSTEIN 
‘My life sucked’ 


Veteran computer scientist and Internet 
pioneer in Greenbush, Michigan, 

who helped to create e-mail and the 
precursor to PayPal. 


When I was in graduate school in the 1980s, 
studying the theory of computation, my adviser 
— who had been very encouraging — suddenly 
started indicating that he didn't understand 
what I was doing. I got very depressed and 
almost left the programme. I didn’t perceive 
my reaction as symptomatic of an illness. I just 
thought that the whole world sucked, that my 
life sucked and that everything was wrong. 

I get depressed a couple of times a decade, 
and my depressions tend to look the same. I 
spend a lot of time curled up in bed, not wanting 
to talk to anyone. Everything looks bad. 

I think that the better you know your 
pattern, the more likely you are, and the 
quicker you are, to recognize it. When I start to 
recognize my pattern, it’s comforting to know 
that the world is not completely horrible and 
that I’m just depressed. I immediately make an 
appointment with my therapist. I want to be 
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open about this, so that when younger people 
read this, they can be spared that response. At 
least if they know it’s an illness, when they're 
curled up in a ball, it’s better than thinking that 
the whole world sucks. 

There is a tremendous resistance to taking 
psychiatric drugs, but I would encourage 
people who are struggling with depres- 
sion to consider them. In the early 1990s, 
I started antidepressants and that was a 
godsend. Within a couple of hours, I was 
feeling better. And a few days after that, I 
met someone I totally clicked with profession- 
ally and we founded a new digital company, 
the predecessor to PayPal. 

The times that you might think were my 
biggest triumphs, when you might have 
expected me to be dancing around and doing 
a jig — those are the 


times I’ve tended “J choose one 

to get extremely path and look 
depressed. It's because at someone who 

I have accomplished has gone down 
this thing and it was a different ath 
a lot of work — but P 
and I feel like I’ve 
it was nowhere near 


what I thought I fallen short.” 


ought to have accom- 

plished. I choose one path and look at someone 
who has gone down a different path, and I 
feel like I’ve fallen short. 

My most recent episode was in January and 
February of this year. I went to the hospital 
with constant, extreme abdominal pain, 
which was making me very depressed, and 
was diagnosed with colitis. That was the first 
time I'd thought of suicide as a way to end the 
pain. Since then, I have got help and the pain 
is under control. 

I’ve heard that depression is a gift, because 
it can help us to recognize when we need to 
change something. But if anyone had told me 
in 1981 that my depression was a gift, I would 
not have believed it for an instant. I once came 
this close to destroying my career. 


ANONYMOUS 
‘Know your limits’ 


Male molecular biologist in the 
United States. 


I was in graduate school in South America. I was 
frustrated there and had a period of depression. 
My principal investigator (PI) suggested I see 
a psychiatrist, who gave me medication for 
bipolar disorder, and it helped. I also realized 
that graduate school would be over soon and 
that I'd be able to move on to the next thing. 

In January 2015, about six months before 
graduating, I started looking for a postdoc posi- 
tion and researched a lot of labs. I sent my CV to 
40 or 50 places, along with detailed cover letters. 
And I had a really good publication record. 


But I was not getting any responses. I became 
despondent, fell into a debilitating depression 
and began drinking heavily. It was a black hole. 

The whole time I was applying, my PI 
kept saying: “Why don't you write a postdoc 
fellowship to stay in the lab here with me, in case 
things fall through?” So I did. And my fellow- 
ship application came through with high marks 
and got funded. But I didn't stop looking for 
other postdocs, and began applying to a couple 
of places I didn’t tell my PI about. 

One didnt ask for a recommendation letter, 
and when the people there saw my CV, they 
hired me on the spot. Only later did I discover 
that my PI had been sabotaging my career. 

Academia has structural disadvantages for 
people with mental-health issues. To a great 
extent, your career path is decided by factors 
beyond your control. Confidential letters of rec- 
ommendation are very important in the hiring 
process, particularly in science. You're not going 
to know that someone threw your CV in the 
garbage because someone else said, “Hey, he’s 
depressed”. Those opportunities are lost for ever. 

Unless there is a structural change in the 
system, that bias is not going away. My advice? 
Know your rights. Carefully document any 
instances of abuse or misconduct. And try 
to ensure that a former employer will have 
nothing to use against you that could affect 
your future career. This includes keeping your 
mental-health issues out of the workplace as 
much as possible, even if you have to keep odd 
hours in the lab to work around therapy. 

Learn to carefully balance the needs of your 
job against your personal and mental needs. 
I take a mood stabilizer and can’t drink alcohol 
at all. And I know when I need to leave work. I 
think, “Ive spent enough time here. I think Pll 
swim in the pool.” You have to know your limits. 


ANONYMOUS 
“You’re not alone’ 


Female bioscientist in Europe. 


When I applied to PhD programmes, I was 
accepted by two in the United States. One 
offered me funding and the other didn't. But 
the one that didn’t was more prestigious, and I 
was misled into thinking that I would have no 
trouble getting teaching-assistant positions to 
help pay for the programme. So I decided to go 
there. I made a very big mistake. 

I arrived in the summer and got a position 
as a teaching assistant — but I had to reapply 
for a new one every couple of months. On top 
of that, I got accepted as a research assistant 
in the lab of a famous professor, but he paid 
nothing. I was also taking a full load of courses 
and studying for qualifying exams. 

The first semester was very difficult. I 
couldnt focus. I was always lost. I was always 
sad. I had to drop a course. If only someone 
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at that time had said: “You have depression 
and that’s why you can't focus.” 

The next semester, I took a lighter load. 
I thought it would be easy. But I got a low 
score. I also wrote an internal grant that got 
funded, but I never saw the money. At that 
point, I didn’t want to get up or out of bed. I 
started being absent from the lab. I couldn't 
bring myself to walk to that place. The 
worst part is that I ended up with a C. My 
grade-point average crashed. Then, when 
exams came, I failed miserably. I worried 
about my teaching-assistant position and 
losing that money. I panicked. I was sitting 
there crying and thinking my world had 
fallen apart. 

I called a counsellor and told them that 
I was considering self-harm. There is a law 
in that area that if you try to hurt yourself, 
they put you in the psychiatric hospital. The 
police came in and handcuffed me. I was 
just crying and repeating, over and over, “I 
got a C. I'm going to get kicked out.” In the 
hospital, I was monitored 24/7. After two 
days, I was still crying. 

Ten days after my first hospitalization, I 
was handcuffed again and put in the hospital 
again. Each time I was put in the hospital by 
the police, I was charged more than US$100 
— a huge expense for me. I contacted the 
other PhD pro- 


gramme to see if “Reach out for 
the poe there help. Surround 
wou onour rself wit 

their funding offer peel id ee 
from the previous i i 
year. They said no. show you, utils 


darkness, that 


aaa ie you will make it.” 


of pain medica- 
tion and started 
popping the pills in my mouth. Then I 
stopped and called my psychologist. After a 
few weeks off from my PhD studies, another 
hospitalization and two failed exams, my 
position in the programme was terminated. 

I started looking for other PhD courses 
and found one with a renowned scientist 
in Europe that would pay me enough, so I 
went there. But you can't just say, “It worked 
out in the end.” For a year, I sat and stared 
at the wall. Even though the position is well 
paid, the culture still treats PhDs poorly and 
values unhealthy competition. I am leaving 
academia. 

Tell the darkness not to take you today 
— not this moment, not this second. Take 
time offif you need to. Find the person you 
were before the darkness came. Reach out 
for help — surround yourself with people 
who will show you, in this darkness, that 
you will make it. As isolating and hopeless 
as it might seem right now, you are not the 
only one who has had to pass through this. 
You are not alone. 
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Tell the stories in 
your science 


Use principles of narrative fiction to inspire your 
research papers, says Amanda C. Niehaus. 
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Council fellowship at the University of 

Queensland in Brisbane, Australia, I pub- 
lished papers about sex-crazed marsupials, 
wrote grant applications and finished The 
Breeding Season, an as-yet-unpublished novel. 

Yes, I wrote a novel — fiction founded on my 
science research. The story takes what I know 
about reproductive conflict in wild animals 
and explores it in humans. It’s the hardest and 
most gratifying thing I have ever done. 

I never expected to write about science in 
this way, mainly because I thought science and 
art were mutually exclusive. I learnt to avoid 
bias and passion, and to present my data as if 
seen through a microscope from a careful dis- 
tance. Science is objective. Art, by contrast, is all 
about perspective, subjectivity and the confused 
experience that is life. 

But creative writing can make scientific 
communication more powerful. We remember 
stories because our brains are wired to: we find 
them more interesting and are more likely to 
retell information if it’s presented as narrative 
rather than exposition. In fact, papers written 
ina more-narrative style might be published in 
higher-impact journals and cited more often 
(A. Hillier et al. PLoS ONE 11, e0167983; 2016). 

I’ve spent the past five years collecting data on 
ageing in northern quolls (Dasyurus hallucatus), 
small marsupials whose males essentially copu- 
late themselves to death. Asa scientist, I analyse, 
interpret and present the data with objectivity, 
according to the conventions of my field. 

But I don’t want to put distance between 
myself and my work, or between my work and 


I: the final month of my Australian Research 


the public. I want to show that the questions 
these animals face are universal — for exam- 
ple, how much time and energy to expend in 
making babies, and what that investment means 
for every other aspect of life. These issues define 
the choices we make about when and whether to 
have children, how to balance our careers with 
our families and how to spend our finite lives. 

This is messy stuff. Its complex and com- 
plicated in ways not easily explored in a paper. 

Last year, I published an essay called 
‘Pluripotent’ (A. C. Niehaus Creative 
Nonfiction 64, 22-26; 2017), which exam- 
ines the role of stem cells in the context of my 
experiences as a mother, researcher and cancer 
survivor, but also, metaphorically, as a woman 
in science. Creative writing can bring difficult 
concepts to life. 

For me, compressing science into academic 
journals simply isn’t enough. I’m frustrated by 
the need to reduce my ideas and experiments to 
publishable pieces while simultaneously ensur- 
ing that they are broadly relevant. I believe that 
the most exciting things happen at the fringe, 
the overlap, the moment we look at the same 
question through a different lens altogether. 
New ideas happen outside our comfort zone. 

My research shows that among northern 
quolls, males and females lead different lives 
because of variations in how their bodies 
balance reproduction and longevity. How 
would these differences affect a reproductive 
relationship between humans? Their careers? 
Their ambitions? Isn't this what many of us 
really want to know? 

In writing quoll biology into my novel anda 
short story, I discovered that artists and writers 
seek truth as much as scientists do. They embed 
facts with experiences to give them context and 
meaning. And stories deal not only with what 
is true, but also with what is possible. Through 
fiction, I may discover something about sex and 
death that my research did not tell me. 

Where to start? Take a workshop in creative 
writing, curate an online gallery of inspiring 
images or invite writers to your next sympo- 
sium. Stories are there in every book, movie 
and conversation — so notice them, and 
harness their energy to share your work. m 


Amanda C. Niehaus is a scientist and writer 
in Brisbane, Australia. 
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Ua SCIENCE FICTION 


BLUE SKY RESEARCH 


BY IAN STEWART 


r | Vhe sun beat down like a hammer. 
Stoneseeker scoured the base of the 

cliff, looking for rock samples to take 
back to the elders. It was his first solo explo- 
ration, and his brief was simple: find more 
flint. The chief had relieved him of hunting 
duties, providing food and a waterskin. In 
return, the young man had promised to seek 
out a new source of the dense black flint 
that could be knapped into knives, arrow- 
heads and spearheads, for the old sources 
were running out. 

He tracked speckled seams of dark stones 
as they wove along white cliffs, but the qual- 
ity of the flint started out poor and steadily 
got worse. Eventually the seams ran out alto- 
gether, mainly because the cliffs ran out. He 
could not return to the tribe empty-handed, 
not when it had provided so generously for 
his expedition. He camped for the night, 
making a fire to scare away predators. 

Next morning, he walked across shattered 
rock and gravel, heading for a distant hill. If 
not flint, perhaps there would be obsidian! 
The glassy material was hard to work, but its 
shards were sharp and the edge lasted longer. 

No flint, no obsidian. But he did find an 
outcrop of rock, of a kind he had never seen 
before; almost the colour of the sky, but tinged 
with green. Interesting. Curiosity aroused, he 
put a few lumps in his shoulder-bag. 

Flint, he reminded himself, and walked 
on. Two days later he gave up, and turned 
back to face the tribe's wrath. 


The chief towered over him, splendid in 
eagle-feather robes. Stoneseeker trembled. 
“T failed, Bearkiller? he confessed. “I found 
no flint.” 

“That was not your promise when you 
were relieved of mammoth-hunting duties 
and given food caught by others!” 

I promised to look, not to find, Stoneseeker 
thought. But it was unwise to contradict the 
chief. 

“Don't hunt, dort eat,” Snakebite the 
shaman jeered. 

“T did find these,’ Stoneseeker said hum- 
bly. Bearkiller took the strange bluish stones, 
and passed them to Blademaker. The flint- 
knapper placed one on his thigh-pad and 
struck it with a hammer-stone. It crumbled 
into fragments. 

“Wont take an edge,” he complained. 
“Useless!” 

Stoneseeker, still intrigued by the strange 
stones, gave utterance to a thought that had 
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been growing in his mind. “Bearkiller, I seek 
a boon?” 

“A boon? For failure?” 

“Tam Stoneseeker! I seek stones. I found 
stones. Er — just not flint.” 

“You found wrong stones!” roared 
Bearkiller. “I sent you for flint!” 

“Well, yes... but I found these stones 
instead, great chief, and ’m wondering 
whether it’s worth investigating them. They 
have the colour of the sky. Such marvels 
could have powers undreamt of!” 

Blademaker sneered. “Can your sky- 
stones make a spear? A knife? Even a rudi- 
mentary hide-scraper?” He picked up the 
fragments, let them fall through his fingers. 
“They dont even make serviceable hammer- 
stones!” He spat. “Useless!” 

“How do you plan to investigate the 
stones?” Bearkiller asked quietly. 

“Uh — [havent really got that far, great 
chief. Drop them in water? Throw them at 
hares? Leave one lying in the moonlight? 
Rub them with —” 

“And what do you propose to find?” 

“Thave no idea.” 

“Then what use are they?” 

Stoneseeker sought a helpful analogy. 
“Great chief: what use is a newborn baby?” 

“Not my department. Call the midwife.” 

“Dont listen to the idiot!” Mammoth- 
stabber yelled. “He’s just trying to con us 
into feeding him while he weasels out of the 
hunt!” 

“Dont hunt, don’t eat,” the shaman 
repeated. 

Bearkiller stroked his impressive beard 
thoughtfully. “Half a day’s walk from here 
is a tar-pit? he said enigmatically. “What do 
we find there, Stoneseeker?” 

“Lions,” Stoneseeker mumbled. He could 
see where this was going. “Dead ones.’ 

“And what made them walk into the tar 
to die?” 


The real answer was trapped prey, for 
food, but what the chief expected was the 
conventional folk-wisdom. 

“Curiosity,” Stoneseeker whispered. 

“Curiosity killed the cat!” the shaman 
declaimed. 

“You mean well, youngster,” said the 
chief. “But the tribe can’t squander 
precious food to let you waste good 
hunting-time, merely to satisfy your 

curiosity.” He raised his eyes to the heav- 
ens. “We need practical ideas. As well as try 
to find out why your stones are the colour 
of the sky!” The other elders howled with 
laughter. “You can’t tell us what you will 
investigate. You can't tell us what you will 
find. You don't know what you're doing.” 

If I knew beforehand what Id discover, 
thered be no point in trying to discover it, 
Stoneseeker thought, but remained silent. 


Stoneseeker showed the sky-stones to 
Foxfoot, who made pots. “Heat does strange 
things to colours,” she mused. “If I grind 
some of these they might just work as glaze. 
Or the heat might make the stones stronger. 
I’ve built a new kiln, the hottest yet! I'll heat 
them overnight” 

Next morning, when Foxfoot looked 
for the stones in the kiln, there was noth- 
ing but grey ash. Raking it out, something 
strange caught her eye. She picked it up and 
wiped away the ash. It was a lumpy tangle of 
reddish material. Some bits glistened in 
the sunlight. 

They took it to show Bearkiller. 

“T told you not to investigate further.” 

“We did it in our spare time, great chief.” 

“People who eat at the tribe’s expense 
but produce nothing should not have spare 
time.” He glared at Stoneseeker. “You're 
going to ask me to reverse my decision, 
arent you? Just because you've found some- 
thing you don’t understand” He turned 
the irregular mass of — something — in his 
hand. “T'll ask the expert. Blademaker!” 

The flint-knapper rushed over. 

“What do you think of this stuff?” 

Blademaker took the strange lumpy mat- 
erial and grunted unhappily. It bent easily in 
his hands. “Too soft. It will never replace the 
flint spearhead! Useless!” 

Bearkiller nodded. And threw it away. = 


Ian Stewart'’s latest SF novel is Rock Star. 
He is sole winner of the Bloody Stupid 
Johnson award for Innovative Uses of 
Mathematics, and an honorary wizard of 
Unseen University. 
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ROBOTICS 


A gripping problem 


Designing machines that can grasp and manipulate objects with anything 
approaching human levels of dexterity is first on the to-do list for robotics. 


n the 1973 science-fiction classic 
Westworld, the robots that staff the film’s 
fictitious holiday resort are indistinguish- 
able from their human guests, except for one 
small clue: the engineers, we are told, haven't 
perfected the hands yet. 
The capabilities of real-world robots fall 
a long way short of those of Westworld’s 
murderous hosts, but on this small point, 
reality and fiction are in agreement: hands, and 
the manipulation of objects, are particularly 


challenging aspects of robotics. “Grasping is 
the critical grand challenge right now,” says 
Ken Goldberg, an engineer at the University 
of California, Berkeley. 

In the past 50 years, robots have become 
very good at working in tightly controlled 
conditions, such as on car-assembly lines. 
“You can build a robotic system for one specific 
task — picking up a car part, for example,” 
says Juxi Leitner, a robotics researcher at 
the Australian Centre for Robotic Vision 


(ACRV), based at the Queensland University 
of Technology in Brisbane. “You know exactly 
where the part is going to be and where the arm 
needs to be,” he says, because the robot picked 
up the same thing from the same place “the last 
million times”. 

But the world is not a predictable assembly 
line. Although humans might find interacting 
with the countless objects and environments 
found beyond the factory gates a trivial 
task, it is ttemendously difficult for robots. 
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ROBOTICS 


>» These unstructured environments 
represent the next frontier for researchers across 
the field of robotics, but such environments are 
particularly problematic for robots that grip. 
Any robot hoping to interact physically with the 
outside world faces an inherent uncertainty in 
how objects will react to touch. “We can predict 
the motion of an asteroid a million miles away 
far better than we can predict the motion of a 
simple object being pushed across the table,” 
Goldberg says. 

Some researchers are using machine learning 
to empower robots to independently identify 
and work out how to grab objects. Others are 
improving the hardware, with grippers ranging 
from pincer-like appendages to human-like 
hands. And roboticists are also gearing up to 
tackle the challenge of manipulating objects 
gripped in the hand. 

Advances in robots’ ability to handle 
objects could have enormous societal impact. 
Commercial entities, particularly those 
involved in the movement of varied goods, are 
following developments closely. “There's a big 
demand. Industry really wants to address this 
because of how fast e-commerce is growing,” 
says Goldberg. With interest greater now than 
ever, “It’s an opportunity for the research to 
really be put into practice.” 


LEARNING TO LEARN 

The heightened industry interest is exemplified 
by an annual competition organized by 
e-commerce giant Amazon for the past three 
years. The Amazon Robotics Challenge asks 
teams of researchers to design and build a 
robot that can sort the items for a customer's 
order from containers and place them together 
in boxes. The items are varied, ranging from 
bottles and bowls to soft toys and sponges, 
and are initially jumbled together, which 
makes it a difficult task in terms of both object 
identification and mechanical grasping. 

In July 2017, Leitner’s ACRV team claimed 
victory with a robot called Cartman, which 
resembles a fairground ‘claw’ game. An 
aluminium frame supports the claw assembly. 
The robot has two tools for picking up objects, 
known as end effectors: a gripper with two 
parallel plates, and a suction cup backed by 
a vacuum pump. For each object the robot 
encounters, the researchers specify which 
effector it should try first. If that doesn’t work, 
the robot switches tools. 

First, however, the robot must find the 
item it’s looking for. The team tackled this 
challenge by using machine learning. The 
main input for the robot is an RGB-D cam- 
era, a technology that is popular among 
roboticists and that can assess both colour 
and depth. The camera looks down from 
the effector into the boxes below. From this 
vantage point, Leitner explains, Cartman 
labels each pixel according to the object it 
belongs to — a form of deep learning known 
as semantic segmentation. Once a cluster of 
pixels representing the desired object is found, 
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the camera's depth-sensing capability helps the 
robot to work out how to grab the item. “In 
simple terms, we attach to the bit that pokes 
out the most,” says Leitner. 

Rapid advances in machine learning 
underpin a lot of recent progress in grasp- 
ing. “Software has been the bottleneck for 
ages, but it’s becoming more advanced 
thanks to deep learning,” says Pieter Abbeel, 
a deep-learning specialist at the University 
of California, Berkeley. These developments 
have, he says, opened up “whole new avenues 
of robotics applications”. 

Abbeel is co-founder and chief scientist 
of covariant.ai, a start-up in Emeryville, 
California, that uses deep learning to train 
robots. Rather than program a robot to 
perform a specific action, humans provide 
demonstrations that the robot can then adapt 
to deal with variations of the same problem. 


“GRASPING 


IS THE 


CRITICAL 


GRAND CHALLENGE 
RIGHT NOW.” 


The human trainer views the camera feed 
of the robot arm through a headset, and uses 
motion controls to guide the robot arm to pick 
up objects. The process feeds a neural network 
with data on the approach taken. “With just 
a few hundred demonstrations done in this 
particular way, you can train a deep neural 
network to acquire a skill? says Abbeel. “And 
I don’t mean acquire a specific motion that 
it’s going to repeatedly execute, but acquire 
the ability to adapt the motion to whatever it’s 
seeing in its camera feed” 

Goldberg uses machine learning to teach 
robots to grasp, too. But rather than gather 
data from real-world attempts, his Dex-Net 
software is trained virtually. “We can simu- 
late millions of grasps very quickly,’ he says. 
The software lets an industrial robot pick 
objects from a pile with a success rate of more 
than 90%, even if it hasn’t seen those objects 
before. It can also decide for itself whether to 
use a parallel-jaw gripper or suction tool fora 
particular object. 

Dex-Net’s fourth incarnation will be 
presented in 2018. According to a metric 
being developed by Goldberg and roboticists 
around the world to aid reproducibility, known 
as mean picks per hour, the Dex-Net system is 
now among the fastest pickers around. It can 
achieve over 200 picks an hour — still behind 
human capacity, estimated at 400-600 picks an 
hour, but far ahead of the numbers achieved by 
the teams at the most recent Amazon Robotics 
Challenge (see ‘A measure of success’). 


But Dex-Net’s simulated world is imperfect. 
The model assumes that objects are rigid, for 
instance, and does not account for objects that 
contain liquid. Simulation, Abbeel says, might 
not always be the easiest way to learn. “The real 
world provides a simulator for free;” he says. 
“It's great to leverage both ways of doing it” 


SOFTLY DOES IT 

The combination of a parallel-jaw gripper 
and suction employed by both Goldberg and 
Leitner is a popular choice: most teams at the 
2017 Amazon Robotics Challenge took this 
hybrid approach. But how these tools grasp 
objects — planning contact points before 
moving into position as precisely as possi- 
ble — is very different to how we humans use 
our hands. 

“When you pick something like a pen up off 
a table, the first thing you touch is the table,” 
says Oliver Brock, a roboticist at the Technical 
University of Berlin. We do not think about 
where we need to place our fingers. The soft- 
ness of human hands allows for something 
called compliant contact — the fingers mould 
against the surface of the object. “Because you 
get a lot of surface contact, you can much more 
intuitively reach out and grab,’ says Daniela 
Rus, a roboticist at the Massachusetts Institute 
of Technology in Cambridge. “With soft 
fingers, we change the paradigm of grasping.” 

Many are seeking to exploit the benefits of 
compliance in grasping by building softness 
into robot grippers. Brock’s lab has developed 
the RBO Hand 2, a human-like hand with five 
silicone fingers. The fingers are controlled by 
the movement of pressurized air, which allows 
them to curl and straighten as required. 

Although the human-like arrangement of 
the fingers might not be suited to every task, 
it is ideal for interacting with the world we 
inhabit. “The world is designed for anthropo- 
morphic hands,” says Brock. But there's also 
a romantic element to the anthropomorphic 
design of his robotic hand. “It’s embarrassing,” 
he says. “People, even roboticists, are more fas- 
cinated by things that look human.” 

The benefits of softness are already drawing 
commercial interest. Soft Robotics, in Cam- 
bridge, Massachusetts, produces air-actuated 
grippers that have a more claw-like design than 
Brock’s research model. The robots are already 
being trialled in a factory setting, handling 
delicate produce without damaging it. 

Another start-up, Righthand Robotics in 
Somerville, Massachusetts, is adding soft- 
ness to the claw and suction set-up so popular 
with roboticists. Its claws have three flexible 
fingers, arranged around a central suction cup 
that can be extended to draw in objects. The 
design takes inspiration from birds of prey, 
says the company’s co-founder Lael Odhner. In 
these birds, most of the forearm musculature 
is attached to a single group of tendons that 
reaches to the tip of the claw, he says. Similarly, 
all the power of the motors in Odhner’s robotic 
claws is put into a single closing motion. This 
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A MEASURE OF SUCCESS 


ROBOTICS 


To make it easier to compare experimental results in robotic grasping, Ken Goldberg, an engineer at the 
University of California, Berkeley, and others are preparing a metric known as mean picks per hour that 
could be used as a standard measure across the field. By this metric, the Dex-Net system seems to be 
considerably more capable than the robots that competed in the 2017 Amazon Robotics Challenge. 
They are all, however, still far behind humans, who can manage 400-600 picks per hour. 
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simple action improves reliability — a crucial 
consideration for grippers intended for com- 
mercial use — at the expense of the ability to 
perform delicate motions. The extendable 
suction cup makes up for this shortfall. “It 
replaces potentially dozens of fine actuators 
that you would otherwise have to place in the 
hand,” says Odhner. 


IN OUR HANDS 

Softness is still relatively new to robotics. “It’s 
an extraordinarily powerful concept, but peo- 
ple are just beginning to figure it out,” says Rus. 

A common criticism is that it’s hard for 
a soft robotic hand to perform a useful 
action with a grasped object. “You grab it 
very well,” explains Rus, but “you don't know 
exactly the orientation of the object inside 
the hand”. This makes it tricky to manipulate 
the object. Goldberg echoes this: “If you just 
have a soft, enveloping type of hand, then 
you've really reduced your visibility of that 
object,’ he says. 

The way people solve this is simple: we 
touch. But few robots have been granted this 
capability. “People agree that it is important,” 
says Brock — it’s just very difficult to do. 
He is pursuing two approaches to giving his 
soft robot hands a sense of touch. The more- 
mature method involves embedding tubes of 
liquid metal in a silicone sheet wrapped around 
the finger. His team can then monitor applied 
forces using the electrical resistance along the 
tubes. “It’s measuring strain all over the fin- 
ger and inferring from that, through machine 
learning, what actually happens to the finger” 
The team is currently assessing how many of 
these strain sensors are required on each finger 
to measure various forces. 

Brock’s other approach to incorporating 
touch centres on acoustics. In a proof-of- 
concept test, a microphone was placed inside 
the air chamber ofa soft finger. The sound that 
was recorded enabled researchers to identify 
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which part of the finger was touching some- 
thing, the force of the touch and the material of 
the object. The ability to tuck the microphone 
deep inside the finger — and therefore avoid 
reducing compliance — sidesteps a key issue 
with sensorizing soft fingers. Brock says that 
the details of this work will be published soon, 
and that he plans to work with acoustics spe- 
cialists to improve his design. 

Many roboticists think that there is unlikely 
to be a universal solution to grasping. Even if 
robots could achieve human levels of dexter- 
ity, Rus points out, “there are lots of things that 
we cannot pick up with a human hand.” But as 
robots become increasingly adept at handling 
variability, more tasks currently performed by 
humans will become automatable. 

A report published last year estimates that 
most occupations could be at least partially 
automated (go.nature.com/2rjtnud). But Gold- 
berg stresses that robotics does not have to put 
people out of work. “My goal is not to replace 
humans,’ he says. “What I want to do is assist 
humans.” 

Whatever the outcome, a lot of development 
needs to be done before any robot revolution 
can take place. Leitner’s team might have won 
Amazon's challenge last year, but its robotic 
arm fell to bits on the first day of competition, 
and most teams experienced technical issues 
of some kind or another. “These systems are 
not super robust yet,’ Leitner says. “If you were 
to take that system and put it into an Amazon 
warehouse, I’m not sure how long it would 
actually work for.” 

“Tm someone who is exploring a new 
world,’ says Brock, “and I’m not ready yet to 
spring in to the limelight and make industrial 
applications left and right.” But, as Goldberg 
says, “there’s no barrier to putting these into 
practice. It’s already starting to happen.” = 


Richard Hodson is supplements editor for 
Nature. 
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AUSTRALIA'S BIOTECHNOLOGY SECTOR 


Changing landscape 


Tax breaks have boosted Australian biotechnology and fuelled overseas investment, 
but industry insiders fear that threatened budget cuts could slow growth. 


n 1999, an Australian federal government 

briefing paper on biotechnology in the 

country concluded that the sector “hardly 
rates as an economic force” because of its small 
size and the financial challenges that it faced in 
getting products to market. 

Now, barely two decades later, Australia has 
ranked in the top five globally for biotechnol- 
ogy three years running, outperforming nations 
such as the United Kingdom and Germany, 
according to Scientific American Worldview. 
The medical-biotechnology sector has benefited 
from a multibillion-dollar windfall of govern- 
ment funding, as well as from substantial tax 
breaks for companies investing in research and 
development. 

A run of high-profile international sales 
and licensing deals has attracted considerable 
interest from multinational pharmaceutical 
companies looking to invest in biotechnology. 


And although the medical sector garners 
the most headlines and funding, Australian 
agricultural, environmental and industrial 
biotechnologies are making their debut 
on the world stage. But despite the enthu- 
siasm, there are concerns that Australian 
biotechnology faces a major threat from the 
federal government's planned overhaul of a 
tax-incentive scheme that many argue has 
delivered the industry's biggest boost. 

Dean Moss, who leads UniQuest — the 
commercialization arm of the University of 
Queensland in Brisbane — highlights the sec- 
tor’s buoyancy, adding: “You've got a very, very 
healthy environment, you've got a load of state 
initiatives for medical devices and therapeutics, 
so there’ a real alignment of the planets to make 
it easier to access capital to develop’ 

Scientific-research funding has been severely 
cut in recent years — falling to just below 0.6% 


of gross domestic product in 2014-15. But 
biotechnology has received a lot of cash from 
the government. First, there was the Medical 
Research Future Fund, announced in May 2014; 
this was an Aus$20-billion federal-government 
fund (worth about US$18 billion at the time) to 
support medical research and innovation. Then, 
in 2016, came the Biomedical Translation Fund, 
an $500-million scheme jointly supported 
by federal government and private coffers to 
“translate good science and research effort into 
real-life healthcare solutions” 

And the Research and Development (R&D) 
Tax Incentive has been providing refundable 
tax offsets since July 2011. Glenn Cross, chief 
executive of AusBiotech in Melbourne — the 
sector's industry body — says that not only is 
the top end of the biotechnology sector look- 
ing extremely healthy (see ‘Deal makers’), but 
there's plenty more activity further down. 
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SOUS AUSTRALIAS BIOTECHNOLOGY SECTOR 


» “At the other end of the market, we’re 
continuing to see new technologies commer- 
cialized and investment coming into the sector,’ 
Cross says. As well as a steady stream of bio- 
technology companies listing on the Australian 
stock exchange, Cross says, the R&D Tax 
Incentive scheme is encouraging international 
companies to see Australia as a destination for 
research and development. 

Companies with a turnover of less than 
$20 million a year may be able to get 43.5% 
of their annual R&D spend refunded in cash; 
companies with higher turnovers could be 
eligible for a 38.5% tax offset. In 2013-14, there 
were nearly 12,000 registrations for the incen- 
tive scheme. Overall, the federal government 
refunded around $2.95 billion for an R&D 
spend by those companies of $19.5 billion. 

But two reports have questioned the scheme’s 
effectiveness in boosting research and develop- 
ment. The first, presented to Australia’s prime 
minister, Malcolm Turnbull, in April 2016, 
concluded that the growth of the tax incen- 
tive was jeopardizing the scheme'’s long-term 
sustainability (see go.nature.com/2rj9z1s); it 
proposed capping a company’s annual refund 
at $2 million, to keep costs down. Its authors 
argued that this was unlikely to harm small and 
medium-sized enterprises, which were already 
gaining a premium refund because of their 
lower turnover, nor would it significantly affect 
larger companies, which had greater access to 
alternative sources of funding. 

The second report, published last November, 
recommended an annual cap of $4 million per 
company, with a maximum cumulative refund 
of $40 million per company (see go.nature. 
com/2rjfgyh). The report, produced by Inno- 
vation and Science Australia, an independent 
advisory board, also suggested that the tax 
incentive should kickin only once a company’s 
R&D spending exceeds 1% of its total annual 
expenditure. 

The biotechnology sector is unenthusiastic 
about the capping proposals. “The biggest con- 
cern we have is that they would make changes to 
the R&D Tax Incentive that would stop invest- 
ment coming in internationally,” says Cross. 
They could also result in less research and fewer 
clinical trials being conducted in Australia. 

Federal Treasurer Scott Morrison warned 
that the incentive will undergo big changes in 
the May 2018 budget, saying in an April speech 
that the scheme “has been taken for a ride by 
some and integrity needs to be restored.” 

International interest and investment is vital 
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for the growth of Australian biotechnology. 
Sue MacLeman, managing director and chief 
executive of MTPConnect in Melbourne — a 
not-for-profit, government-funded organiza- 
tion to support the medical-technology and 
pharmaceutical sectors — says that, because 
the sector is relatively small on a global scale, 
Australia is unlikely to develop its own big 
multinational pharmaceutical or biotechnol- 
ogy companies. Rather, it will fit into the bigger 
picture, she thinks, as a market of high-quality 
biotech start-ups for international buyers. 


TALENT SPOTTING 

“We're seeing an increase in the number of 
scouts on the ground from those pharma 
companies, clearly looking for these sorts 
of [biotechnology] science,” MacLeman 
says. She adds that Australian biotechology 
companies are increasingly offering tech- 
nology that has already made it to phase I or 
phase II clinical trials. This makes a start-up or 
spin-off much more attractive to international 
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pharmaceutical companies — and that means a 
bigger return on investment for the Australian 
company once it is bought, MacLeman says. 

Australian medical biotechnology is probably 
most famous for a cervical-cancer vaccine 
developed by Ian Frazer and Jian Zhou at the 
University of Queensland in the 1990s, but the 
sector is now making waves internationally ina 
wide variety of fields. 

“We're very competitive in oncology, 
infectious diseases, things like vaccines, 
respiratory, neurology, ophthalmology [and] 
we see good progress in regenerative medi- 
cine,” MacLeman says, adding that Australia’s 
clinical-trial capability is part of its biotechnol- 
ogy success story. A collaborative network of 
clinical-trial units extends across the country, 
and states such as Victoria are courting interna- 
tional pharmaceutical companies, urging them 
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to bring their early-stage trials to Australia. 

“The fact that we're being asked to do those 
initial clinical trials here in Australia means that 
we have the frameworks in place. We have the 
skill and expertise in place to actually look at 
some of these technologies,’ says MacLeman. 

“You've got world-class infrastructure and 
good clinicians, so you're getting good data 
under independent ethics review and consent, 
so that the data has integrity and security and 
you can make meaningful investment deci- 
sions about the continued development of those 
programmes.” 

MacLeman says that approval for clinical 
trials through the Australian Clinical Trial 
Notification scheme can take as little as six 
weeks, compared with about a year for an appli- 
cation submitted in the United States under 
the Investigational New Drug programme. 

But things are less streamlined for drugs 
from biological sources, says Melissa Little, 
programme leader of Stem Cells Australia — a 
national network of stem-cell researchers and 
institutions. Little, based in Melbourne, says 
that “for a clinical trial where you've got to do 
cellular production, even for a phase 1, it's easier 
to go offsite”. That's because the Therapeutic 
Goods Administration, Australia’s regulatory 
body for therapeutic products, is “much smaller 
and much less experienced with biologics” than 
is the US Food and Drug Administration. 

The TGA also has a loophole in its 
regulation that allows a registered clinician 
to administer a stem-cell treatment using a 
patient’s own cells — even if that treatment is 
unproven — without going through any regu- 
latory checks. Organizations such as Stem 
Cells Australia are lobbying hard to have the 
issue dealt with, and Little says that the TGA is 
looking at how to do this. 

The regulator is well respected at home and 
abroad, but it has been criticized for taking too 
long to approve medicines and medical devices, 
compared with its overseas counterparts — a 
concern that was highlighted in a 2015 review 
commissioned by the government. 

Since then, the TGA has set up a programme 
called SME Assist, to help small and medium- 
sized enterprises navigate more easily through 
the registration process. 


BRANCHING OUT 

Although 69% of Australian biotechnol- 
ogy takes place in the health and biomedical 
arena, the country is also emerging as a global 
player in agricultural, environmental and 
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industrial biotechnology. Michael Christie, 
a patent attorney in Sydney who chairs the 
New South Wales division of the Ag Institute 
Australia — the industry body for agricultural 
and natural-resource management profes- 
sionals — says that the country is particularly 
strong on basic and applied research in agri- 
cultural biotechnology. But he adds that the 
take-up is more patchy. For one thing, South 
Australia, Tasmania and the Australian Capital 
Territory all have moratoriums on the commer- 
cial cultivation of genetically modified crops. 

Christie says that the regulatory framework 
for genetically modified crops is also a barrier 
to adoption. “Generally speaking, the scheme 
is quite complicated and the procedures for 
obtaining regulatory approval are unduly 
cumbersome,’ he says. But he also stresses that 
the framework helps to maintain Australia’s 
reputation as a “clean and green” source of 
agricultural products. 

Although it has not received the same 
funding and investment as medical bio- 
technology, Christie says that agricultural 
biotechnology is becoming more ofa topic of 
conversation among investors. Much of the 
research in the field is done within the Com- 
monwealth Scientific and Industrial Research 
Organisation (CSIRO), Australia’s national 
science agency. But in recent years, the forma- 
tion of primary industry-focused research and 
development corporations, such as AgriFutures 
Australia, has widened the field. 

“These groups have placed a stronger 
emphasis on commercial outcomes, so we're 
seeing more applied-research programs as 
well as collaborations being formed between 
publicly funded research organizations and 
industry,’ Christie says. 

CSIRO is also a hotbed of environmental and 
industrial biotechnology — particularly in rela- 
tion to mining, an area in which Australia is a 
key global player — says Anna Kaksonen, who 
leads the biotechnology and synthetic-biology 
group at CSIRO in Perth. 

Mining biotechnology uses microbes 
“to extract metals from low-grade ores and 
also metal-containing waste material such 
as e-waste, battery waste, mining waste,” 
Kaksonen explains. It is also being used to clean 
up polluted sites, such as old mines. “There are 
alot of legacy sites in Australia, so there is a lot 
of work in that space where microbes could be 
used to remove sulfate metals and acidity from 
the water,” she says. 

Many of the microbes used in this work are 
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SUDA, a company that 
specializes in oro-mucosal 
drug delivery, signs deal 
worth up to $34 million with 
Chinese pharmaceutical 
company Eddingpharm for 
an insomnia therapy. 
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The Walter and Eliza Hall 
Institute in Melbourne sells a 
royalty stake in its anti-cancer 
treatment venetoclax to a 
subsidiary of the Canada 
Pension Plan Investment 
Board for up to $424 million. 


found in nature, but researchers are looking at 
ways of engineering them to better withstand 
the their contaminated environments. 


SKILLS GAP 

Australia has no shortage of ideas in biotech- 
nology, but it is struggling to find people with 
relevant commercial experience, says Cross, 
particularly at the most senior leadership levels. 

“We're certainly getting a lot more of our 
senior executives who have gone overseas and 
worked in big pharma companies or big med- 
tech companies coming back to Australia,” 
he says. “But the ability to satisfy all of the 
senior executive-level jobs that are coming up 
in the industry — it’s difficult to do just from 
Australia.” 

To address this, the Australian government 
announced in March this year a pilot of a new 
version of the entrepreneur visa. Unlike the 
current visa, this one does not require entre- 
preneurs to have capital backing. Ministers 
also launched a Global Talent Scheme, offer- 
ing four-year Temporary Skill Shortage visas to 
science, engineering or medicine start-ups, as 
well as to established businesses seeking people 
for positions with salaries above $180,000. 

The relatively small size of Australia’s private 
biotechnology sector — the national Stock 
Exchange lists 140 life-sciences companies — 
can also make it difficult for graduates to find 
work outside academia and research institu- 
tions. Andrew Webb, acting division head for 
systems biology and personalized medicine at 
the Walter and Eliza Hall Institute of Medical 
Research in Melbourne, says that even positions 
in academic research are “few and far between” 
in Australia. 

“I don't know where postdocs are going, 
but there's certainly not a huge amount of jobs 
available, certainly in the academic sector.” 

Australia’s commercial biotechnology sector 
is growing fast, however, and hotspots such as 
Melbourne — which hosts 27 leading biotech- 
nology-research institutes — are gaining ground 
internationally. In terms of supporting and nur- 
turing biotechnology start-ups and spin-offs, 
Webb says that Melbourne is approaching 
the standard of Boston, San Francisco and 
Cambridge, Massachusetts. 

Webb says that, despite Australia’s difficulties 
in commercializing its research, the speed of 
development makes biotechnology an excit- 
ing space to be in. “Everything is advancing so 
rapidly that there are enormous opportunities 
in this space? = 
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Pharma group MSD pays 
around $502 million to 
acquire Viralytics —a 
spin-off from the University 
of Newcastle focused 

on the cancer-busting 
coxsackievirus A21. 
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